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Please enter this Preliminary Amendment into the record of the above-identified patent 
application before the calculation of fees. 

In the Claims: 

Please delete the claims of the application as filed in the PCT and substitute therefor: 

27. An isolated polypeptide comprising a member selected from the group consisting of 

(a) an amino acid sequence which has at least 90% identity to SEQ ID NOs:2 or 4; 

(b) an immunogenic fragment of the amino acid sequence of (a), wherein the 
immunogenic fragment is at least 90% identical to an aligned contiguous 
segment of SEQ ID NOs:2 or 4; and 

(c) an immunogenic fragment of the amino acid sequence of (a) that matches an 
aligned contiguous segment of SEQ ID NOs:2 or 4with no more than five single 
amino acid substitutions, deletions or additions; 

wherein the isolated polypeptide, when administered to a subject in a suitable composition 
which can include an adjuvant, or a suitable carrier coupled to the polypeptide, induces an 
immune response that recognizes a polypeptide having the sequence of SEQ ID NOs:2 or 4. 

28. The isolated polypeptide of claim 27, wherein the immune response recognizes an 
antigen having molecular weight 95 kD (by SDS PAGE) in Moraxella catarrhalis strains Mc 
2904, 2913, 2926, 293 1, 2956, 2960, 2969, and 2975. 

29. The isolated polypeptide of claim 27, wherein the immunogenic fragment of (b) 
comprises at least 15 amino acids. 
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30. The isolated polypeptide of claim 29, wherein the immunogenic fragment matches an 
aligned contiguous segment of SEQ ID NOs:2 or 4 with no more than a single amino acid 
substitution, deletion or addition. 

3 1 . The isolated polypeptide of claim 27, wherein the immunogenic fragment of (b) 
comprises at least 20 amino acids. 

32. The isolated polypeptide of Claim 27 wherein the amino acid sequence of (a) has at 
least 95% identity to SEQ ID NOs:2 or 4. 

33. The isolated polypeptide of Claim 32 wherein the isolated polypeptide comprises the 
amino acid sequence of SEQ ID NOs:2 or 4. 

34. The isolated polypeptide of claim 32 wherein the isolated polypeptide consists of the 
amino acid sequence of SEQ ID NOs:2 or 4. 

35. A fusion protein comprising the isolated polypeptide of Claim 27. 

36. The isolated polypeptide of Claim 27 wherein the polypeptide is the immunogenic 
fragment having no more than two single amino acid substitutions, deletions or additions 
relative to the aligned sequence. 

37. The isolated polypeptide of Claim 27 wherein the polypeptide is the immunogenic 
fragment having no more than one single amino acid substitution, deletion or addition relative 
to the aligned sequence. 

38. The isolated polypeptide of Claim 27 wherein the polypeptide is the immunogenic 
fragment which matches the aligned sequence. 

39. An isolated polypeptide encoded by an isolated first polynucleotide wherein the 
isolated first polynucleotide hybridizes under stringent conditions to a second polynucleotide 
which encodes the polypeptide of SEQ ID NOs:2 or 4; wherein stringent conditions comprise 
overnight incubation at 42° C in a solution comprising: 50% formamide, 5 X SSC (150 mM 
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NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's solution, 
10% dextran sulfate, and 20 micrograms/ml denatured, sheared salmon sperm DNA, followed 
by washing the filters in 0. 1 x SSC at about 65° C; wherein the isolated polypeptide, when 
administered to a subject in a suitable composition which can include an adjuvant, or a suitable 
carrier coupled to the polypeptide, induces an immune response that recognizes a polypeptide 
having the sequence of SEQ ID NOs:2 or 4. 

40. An isolated polynucleotide encoding a polypeptide of Claim 27 or the full complement 
to the isolated polynucleotide. 

41. An isolated polynucleotide encoding a polypeptide of Claim 27, wherein the isolated 
polynucleotide encodes the polypeptide comprising SEQ ID NOs:2 or 4. 

42. An isolated polynucleotide comprising the polynucleotide of SEQ ID NOs: 1 or 3. 

43 . An isolated polynucleotide segment comprising a polynucleotide sequence or the full 
complement of the entire length of the polynucleotide sequence, wherein the polynucleotide 
sequence hybridizes to the full complement of SEQ ID NOs: 1 or 3 minus the complement of 
any stop codon, wherein the hybridization conditions include incubation at 42°C in a solution 
comprising: 50% formamide, 5x SSC (150mM NaCl, 15mM trisodium citrate), 50 mM sodium 
phosphate (pH7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 micrograms/ml 
denatured, sheared salmon sperm DNA, followed by washing in O.lx SSC at 65°C; and, 
wherein the polynucleotide sequence is identical to SEQ ID NOs:l or 3 minus any terminal 
stop codon, except that, over the entire length corresponding to SEQ ID NOs:l or 3 minus any 
terminal stop codon, n n nucleotides are substituted, inserted or deleted, wherein n n satisfies the 
following expression 

n n < x n - (x n • y) 

wherein x n is the total number of nucleotides in SEQ ID NOs:l or 3 minus any terminal stop 
codon, y is at least 0.95, and wherein any non-integer product of x n and y is rounded down to 
the nearest integer before subtracting the product from x n ; and wherein the polynucleotide 
sequence detects Moraxella catarrhalis. 



44. 



An expression vector comprising the isolated polynucleotide of Claim 40. 
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45 . A host cell transformed with the expression vector of Claim 44. 

46. A process of producing an isolated polypeptide comprising (a) culturing the host cell of 
Claim 45 under conditions sufficient for the production of the encoded polypeptide and (b) 
recovering the polypeptide. 

47. A nucleic acid vaccine comprising the isolated polynucleotide of Claim 40 and a 
pharmaceutically acceptable carrier. 

48. An isolated polynucleotide segment comprising a polynucleotide sequence or the full 
complement of the entire length of the polynucleotide sequence, wherein the polynucleotide 
sequence is identical to SEQ ID NOs: 1 or 3 minus any terminal stop codon, except that, over 
the entire length corresponding to SEQ ID NOs: 1 or 3 minus any terminal stop codon, n n 
nucleotides are substituted, inserted or deleted, wherein n n satisfies the following expression 

n n < x n - (x n • y) 

wherein x n is the total number of nucleotides in SEQ ID NOs: 1 or 3 minus any terminal stop 
codon ? y is at least 0.90, and wherein any non-integer product of x n and y is rounded down to 
the nearest integer before subtracting the product from x n ; and wherein the polynucleotide 
sequence detects Moraxella catarrhalis. 

49. The isolated polynucleotide of Claim 48 where y is at least 0.95. 

50. An expression vector comprising the isolated polynucleotide of Claim 48 which codes 
for a polypeptide that, when administered to a mammal which can include an adjuvant, or a 
suitable carrier coupled to the polypeptide, induces an immune response that recognizes a 
polypeptide having the sequence of SEQ ID NOs:2 or 4. 

51. A host cell transformed with the isolated polynucleotide or an expression vector 
comprising the isolated polynucleotide of Claim 48. 
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52. A process of producing an isolated polypeptide comprising (a) culturing the host cell of 
Claim 5 1 under conditions sufficient for the production of the encoded polypeptide and (b) 
recovering the polypeptide. 

53. A vaccine comprising the polypeptide of Claim 27 and a pharmaceutically acceptable 
carrier. 

54. The vaccine of Claim 53, wherein the composition comprises at least one other 
Moraxella catarrhalis antigen. 

55. An antibody immunospecific for the polypeptide or immunogenic fragment of Claim 
27. 

56. A method for inducing an immune response in a mammal comprising administration of 
the polypeptide of Claim 27. 

57. A method of diagnosing & Moraxella catarrhalis infection, comprising identifying a 
polypeptide of Claim 27, or an antibody that is immunospecific for the polypeptide, present 
within a biological sample from an animal suspected of having such an infection. 

58. A method for inducing an immune response in a mammal comprising administration of 
the isolated polynucleotide of Claim 40. 

59. A therapeutic composition useful in treating humans with Moraxella catarrhalis 
comprising at least one antibody directed against the polypeptide of claim 27 and a suitable 
pharmaceutical carrier. 

60. A process for expressing the polynucleotide of Claim 40 comprising transforming a 
host cell with the expression vector comprising the polynucleotide and culturing the host cell 
under conditions sufficient for expression of the polynucleotide. 
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REMARKS 

Applicant respectfully requests that this Preliminary Amendment be entered in this case 
before the calculation of fees and before examination of the subject application. 

Claims 

Claims 1-26 have been canceled without prejudice or disclaimer of the subject matter 
therein. Applicant reserves the right to prosecute, in one or more patent applications, the 
canceled claims, the claims to non-elected inventions, the claims as originally filed, and any 
other claims supported by the specification. 

New claims 27-60 have been introduced. No new matter is added. 

Support 

Support for the new claims is either obvious, or is as described in the text below. 
Particularly, support for the recitation of "five single amino acid substitutions, deletions or 
additions" may be found, for example, at page 7, lines 1-2. Support for compositions of the 
isolated polypeptide which include an adjuvant recited in the claims may be found, for 
example, at page 40, lines 28-29. Support for the stringent hybridization conditions may be 
found, for example, at page 1 4, lines 20-24. Support for the recitation of sequence relatedness 
such as "wherein the first polynucleotide sequence is identical to SEQ ID NOs: 1 or 3, except 
that, over the entire length corresponding to SEQ ID NOs: 1 or 3, n n nucleotides are substituted, 
inserted or deleted, wherein n n satisfies the following expression 

n n < x n - (x n • y) 

wherein x n is the total number of nucleotides in SEQ ID NOs: 1 or 3, y is at least 0.90, and 
wherein any non-integer product of x n and y is rounded down to the nearest integer before 
subtracting the product from x n " may be found in the specification, for example, at page 44, 
line 7 through page 45, line 2. Support for the recitation, "wherein the immune response 
recognizes an antigen having molecular weight 95 kD (by SDS PAGE) in Moraxella 
catarrhalis strains Mc 293 1, 2904, 2905, 2906, 2907, 2908, 2909, 2910 and 291 1" can be 
found, for example, in Example 6 at page 63, lines 4 and 5 and Figure 7. 
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Closing Remarks 

Entry of this Preliminary Amendment prior to calculation of fees and Examination and 
Allowance of the pending claims are respectfully requested 



Respectfully submitted, 




DECHERT PRICE & RHOADS 
Princeton Pike Corporate Center 
PO Box 5218 

Princeton, New Jersey 08543-5218 
Arthur E. Jackson (609) 620-3254 
Fax: (609) 620-3259 
Attn: Arthur E. Jackson, Esq. 
(609) 620-3254 
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BASB027 PROTEINS AND GENES FROM MORAXELLA CA TARRHALIS, ANTIGENS, ANTIBODIES, AND USES 

FIELD OF THE INVENTION 

This invention relates to polynucleotides, (herein referred to as " BASB027 
polynucleotide^)"), polypeptides encoded by them (referred to herein as ~BASB02r* or 
"BASB027 polypeptide^)"), recombinant materials and methods for their production. In 
another aspect, the invention relates to methods for using such polypeptides and 
polynucleotides, including vaccines against bacterial infections. In a further aspect, the 
invention relates to diagnostic assays for detecting infection of certain pathogens. 

BACKGROUND OF THE INVENTION 

Moraxella catarrhal is (also named Branhamella catarrhalis) is a Gram negative bacteria 
frequently isolated from the human upper respiratory tract. It is responsible for several 
pathologies the main ones being otitis media in infants and children, and pneumonia in 
elderlies. It is also responsible of sinusitis, nosocomial infections and less frequently of 
invasive diseases. 



Otitis media is an important childhood disease both by the number of cases and its potential 
sequelae. More than 3.5 millions cases are recorded every year in the United States, and it is 
estimated that 80 % of the children have experienced at least one episode of otitis before 
reaching the age of 3 (Klein, JO (1994) ClinlnfDis 19:823). Left untreated, or becoming 
chronic, this disease may lead to hearing losses that could be temporary (in the case of fluid 
accumulation in the middle ear) or permanent (if the auditive nerve is damaged). In infants, 
such hearing losses may be responsible for a delayed speech learning. 

Three bacterial species are primarily isolated from the middle ear of children with otitis 
media: Streptococcus pneumoniae, non typeable Haemophilus influenza (NTHi) and M 
catarrhalis. They are present in 60 to 90 % of the cases. A review of recent studies shows 
that S. pneumoniae and NTHi represent both about 30 %, and M. catarrhalis about 15 % of 
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the otitis media cases (Murphy, TF (1996) MicrobioLRev, 60:267). Other bacteria could be 
isolated from the middle ear (H influenza type B, S. pyogenes etc) but at a much lower 
frequency (2 % of the cases or less). 

Epidemiological data indicate that, for the pathogens found in the middle ear, the 
colonization of the upper respiratory tract is an absolute prerequisite for the development of 
an otitis; other are however also required to lead to the disease (Dickinson, DP et al. (1 988) 
J. InfectDis. 158:205, Faden, HL et al. (1991) Ann.OtorhinoLLaryngol. 100:612). These are 
important to trigger the migration of the bacteria into the middle ear via the Eustachian 
tubes, followed by the initiation of an inflammatory process. These factors are unknown 
todate. It has been postulated that a transient anomaly of the immune system following a 
viral infection, for example, could cause an inability to control the colonization of the 
respiratory tract (Faden, HL et al (1994) J. InfectDis. 169:1312). An alternative explanation 
is that the exposure to environmental factors allow a more important colonization of some 
children, who subsequently become susceptible to the development of otitis media because 
of the sustained presence of middle ear pathogens (Murphy, TF (1996) Microbiol.Rev. 
60:267). 

The immune response to M catarrhalis is poorly characterized. The analysis of strains 
isolated sequentially from the nasopharynx of babies followed from 0 to 2 years of age, 
indicates that they get and eliminate frequently new strains. This indicates that an 
efficacious immune response against this bacteria is mounted by the colonized children 
(Faden, HL et al (1994) J. InfectDis. 169:1312). 

In most adults tested, bactericidal antibodies have been identified (Chapman, AJ et al. 
(1985) J. InfectDis. 151:878). Strains of M catarrhalis present variations in their capacity 
to resist serum bactericidal activity: in general, isolates from diseased individuals are more 
resistant than those who are simply colonized (Hoi, C et ah (1993) Lancet 341:1281, Jordan, 
KL et al. (1990) Am. J.Med. 88 (suppl. 5A):28S). Serum resistance could therfore be 
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considered as a virulence factor of the bacteria. An opsonizing activity has been observed in 
the sera of children recovering from otitis media. 

The antigens targetted by these different immune responses in humans have not been 
identified, with the exception of OMP B 1 , a 84 kDa protein which expression is regulated 
by iron, and that is recognized by the sera of patients with pneumonia (Sethi, S, et al. (1995) 
Infectimmun. 63: 1516) , and of UspAl and UspA2 (Chen D. et al(1999), Infect.Immun. 
67:1310). 

A few other membrane proteins present on the surface of M catarrhalis have been 
characterized using biochemical method, or for their potential implication in the induction of 
a protective immunity (for review, see Murphy, TF (1996) Microbiol.Rev. 60:267). In a 
mouse pneumonia model, the presence of antibodies raised against some of them (UspA, 
CopB) favors a faster clearance of the pulmonary infection. Another polypeptide (OMP CD) 
is highly conserved among M. catarrhalis strains, and presents homologies with a porin of 
Pseudomonas aeruginosa, which has been demonstrated efficacious against this bacterium 
in animal models. 

The frequency of Moraxella catarrhalis infections has risen dramatically in the past few 
decades. This has been attributed to the emergence of multiply antibiotic resistant strains 
and an increasing population of people with weakened immune systems. It is no longer 
uncommon to isolate Moraxella catarrhalis strains that are resistant to some or all of the 
standard antibiotics. This phenomenon has created an unmet medical need and demand for 
new anti-microbial agents, vaccines, drug screening methods, and diagnostic tests for this 
organism. 



SUMMARY OF THE INVENTION 



3 



WO 99/63093 



PCT/EP99/03822 



The present invention relates to BASB027, in particular BASB027 polypeptides and 
B ASB027 polynucleotides, recombinant materials and methods for their production. In 
another aspect, the invention relates to methods for using such polypeptides and 
polynucleotides, including prevention and treatment of microbial diseases, amongst others. 
In a further aspect, the invention relates to diagnostic assays for detecting diseases 
associated with microbial infections and conditions associated with such infections, such 
as assays for detecting expression or activity of BASB027 polynucleotides or 
polypeptides. 

Various changes and modifications within the spirit and scope of the disclosed invention 
will become readily apparent to those skilled in the art from reading the following 
descriptions and from reading the other parts of the present disclosure. 

DESCRIPTION OF THE INVENTION 

The invention relates to BASB027 polypeptides and polynucleotides as described in greater 
detail below. In particular, the invention relates to polypeptides and polynucleotides of 
BASB027 of Moraxella catarrhalis, which is related by amino acid sequence homology to 
Neisseria meningitidis OMP85 outer membrane protein. The invention relates especially to 
BASB027 having the nucleotide and amino acid sequences set out in SEQ ID NO: 1 or 3 and 
SEQ ID NO:2 or 4 respectively. It is understood that sequences recited in the Sequence 
Listing below as "DNA" represent an exemplification of one embodiment of the 
invention, since those of ordinary skill will recognize that such sequences can be usefully 
employed in polynucleotides in general, including ribopolynucleotides. 

Polypeptides 

In one aspect of the invention there are provided polypeptides of Moraxella catarrhalis 
referred to herein as "B ASB027" and "BASB027 polypeptides" as well as biologically, 
diagnostically, prophylactically, clinically or therapeutically useful variants thereof, and 
compositions comprising the same. 
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The present invention further provides for: 

(a) an isolated polypeptide which comprises an amino acid sequence which has at least 
85% identity, preferably at least 90% identity, more preferably at least 95% identity, most 
preferably at least 97-99% or exact identity, to that of SEQ ID NO:2 or 4; 

(b) a polypeptide encoded by an isolated polynucleotide comprising a polynucleotide 
sequence which has at least 85% identity, preferably at least 90% identity, more 
preferably at least 95% identity, even more preferably at least 97-99% or exact identity to 
SEQ ID NO: 1 or 3 over the entire length of SEQ ID NO: 1 or 3 respectively; or 

(c) a polypeptide encoded by an isolated polynucleotide comprising a polynucleotide 
sequence encoding a polypeptide which has at least 85% identity, preferably at least 90% 
identity, more preferably at least 95% identity, even more preferably at least 97-99% or 
exact identity, to the amino acid sequence of SEQ ID NO:2 or 4. 

The BASB027 polypeptides provided in SEQ ID NO:2 or 4 are the BASB027 
polypeptides from Moraxella catarrhalis strain Mc293 1 (ATCC 43617). 

The invention also provides an immunogenic fragment of a BASB027 polypeptide, that 
is, a contiguous portion of the BASB027 polypeptide which has the same or substantially 
the same immunogenic activity as the polypeptide comprising the amino acid sequence of 
SEQ ID NO:2 or 4; That is to say, the fragment (if necessary when coupled to a carrier) is 
capable of raising an immune response which recognises the BASB027 polypeptide. 
Such an immunogenic fragment may include, for example, the BASB027 polypeptide 
lacking an N-terminal leader sequence, and/or a transmembrane domain and/or a C- 
terminal anchor domain. In a preferred aspect the immunogenic fragment of BASB027 
according to the invention comprises substantially all of the extracellular domain of a 
polypeptide which has at least 85% identity, preferably at least 90% identity, more 
preferably at least 95% identity, most preferably at least 97-99% identity, to that of SEQ 
ID NO:2 or 4 over the entire length of SEQ ID NO:2 
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A fragment is a polypeptide having an amino acid sequence that is entirely the same as part 
but not all of any amino acid sequence of any polypeptide of the invention. As with 
BASB027 polypeptides, fragments may be " free-standing,' 1 or comprised within a larger 
polypeptide of which they form a part or region, most preferably as a single continuous 
region in a single larger polypeptide. 

Preferred fragments include, for example, truncation polypeptides having a portion of an 
amino acid sequence of SEQ ID NO:2 or 4 or of variants thereof, such as a continuous series 
of residues that includes an amino- and/or carboxyl-terminal amino acid sequence. 
Degradation forms of the polypeptides of the invention produced by or in a host cell are 
also preferred. Further preferred are fragments characterized by structural or functional 
attributes such as fragments that comprise alpha-helix and alpha-helix forming regions, 
beta-sheet and beta-sheet-forming regions, turn and turn-forming regions, coil and coil- 
forming regions, hydrophilic regions, hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surface-forming regions, substrate binding region, and 
high antigenic index regions. 

Further preferred fragments include an isolated polypeptide comprising an amino acid 
sequence having at least 15, 20, 30, 40, 50 or 100 contiguous amino acids from the 
amino acid sequence of SEQ ID NO:2 or 4, or an isolated polypeptide comprising an 
amino acid sequence having at least 15, 20, 30, 40, 50 or 100 contiguous amino acids 
truncated or deleted from the amino acid sequence of SEQ ID NO: 2 or 4. 

Fragments of the polypeptides of the invention may be employed for producing the 
corresponding full-length polypeptide by peptide synthesis; therefore, these fragments 
may be employed as intermediates for producing the full-length polypeptides of the 
invention. 



6 



WO 99/63093 



PC77EP99/03822 



Particularly preferred are variants in which several, 5-10, 1-5, 1-3, 1-2 or 1 amino acids 
are substituted, deleted, or added in any combination. 

The polypeptides, or immunogenic fragments, of the invention may be in the form of 
the '"mature" protein or may be a part of a larger protein such as a precursor or a fusion 
protein. It is often advantageous to include an additional amino acid sequence which 
contains secretory or leader sequences, pro-sequences, sequences which aid in 
purification such as multiple histidine residues, or an additional sequence for stability 
during recombinant production. Furthermore, addition of exogenous polypeptide or 
lipid tail or polynucleotide sequences to increase the immunogenic potential of the final 
molecule is also considered. 

In one aspect, the invention relates to genetically engineered soluble fusion proteins 
comprising a polypeptide of the present invention, or a fragment thereof, and various 
portions of the constant regions of heavy or light chains of immunoglobulins of various 
subclasses (IgG, IgM, IgA, IgE). Preferred as an immunoglobulin is the constant part of 
the heavy chain of human IgG, particularly IgGl, where fusion takes place at the hinge 
region. In a particular embodiment, the Fc part can be removed simply by incorporation 
of a cleavage sequence which can be cleaved with blood clotting factor Xa. 

Furthermore, this invention relates to processes for the preparation of these fusion 
proteins by genetic engineering, and to the use thereof for drug screening, diagnosis and 
therapy. A further aspect of the invention also relates to polynucleotides encoding such 
fusion proteins. Examples of fusion protein technology can be found in International 
Patent Application Nos. W094/29458 and W094/22914. 

The proteins may be chemically conjugated, or expressed as recombinant fusion 
proteins allowing increased levels to be produced in an expression system as compared 
to non-fused protein. The fusion partner may assist in providing T helper epitopes 
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(immunological fusion partner), preferably T helper epitopes recognised by humans, or 
assist in expressing the protein (expression enhancer) at higher yields than the native 
recombinant protein. Preferably the fusion partner will be both an immunological 
fusion partner and expression enhancing partner. 

Fusion partners include protein D from Haemophilus influenzae, and the non-structural 
protein from influenzae virus, NS1 (hemagglutinin). Another fusion partner is the 
protein known as LytA. Preferably the C terminal portion of the molecule is used. Lyta 
is derived from Streptococcus pneumoniae which synthesize an N-acetyl-L-aianine 
amidase, amidase LytA, (coded by the lytA gene {Gene, 43 (1986) page 265-272}) an 
autolysin that specifically degrades certain bonds in the peptidoglycan backbone. The 
C-terminal domain of the LytA protein is responsible for the affinity to the choline or to 
some choline analogues such as DEAE, This property has been exploited for the 
development of E.coli C-LytA expressing plasmids useful for expression of fusion 
proteins. Purification of hybrid proteins containing the C-LytA fragment at its amino 
terminus has been described {Biotechnology: 10, (1992) page 795-798}. It is possible 
to use the repeat portion of the LytA molecule found in the C terminal end starting at 
residue 178, for example residues 188 - 305. 

The present invention also includes variants of the aforementioned polypeptides, that is 
polypeptides that vary from the referents by conservative amino acid substitutions, 
whereby a residue is substituted by another with like characteristics. Typical such 
substitutions are among Ala, Val, Leu and He; among Ser and Thr; among the acidic 
residues Asp and Glu; among Asn and Gin; and among the basic residues Lys and Arg; or 
aromatic residues Phe and Tyr. 

Polypeptides of the present invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
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combination of these methods. Means for preparing such polypeptides are well 
understood in the art. 

It is most preferred that a polypeptide of the invention is derived from Moraxella 
catarrhalis, however, it may preferably be obtained from other organisms of the same 
taxonomic genus. A polypeptide of the invention may also be obtained, for example, from 
organisms of the same taxonomic family or order. 

Polynucleotides 

It is an object of the invention to provide polynucleotides that encode BASB027 
polypeptides, particularly polynucleotides that encode the polypeptide herein designated 
BASB027. 

In a particularly preferred embodiment of the invention the polynucleotide comprises a 
region encoding BASB027 polypeptides comprising a sequence set out in SEQ ID NO:l or 
3 which includes a full length gene, or a variant thereof. 

The BASB027 polynucleotides provided in SEQ ID NO:l or 3 are the BASB027 
polynucleotides from Moraxella catarrhalis strain Mc2931 (ATCC 43617). 

As a further aspect of the invention there are provided isolated nucleic acid molecules 
encoding and/or expressing BASB027 polypeptides and polynucleotides, particularly 
Moraxella catarrhalis BASB027 polypeptides and polynucleotides, including, for 
example, unprocessed RNAs, ribo2yme RNAs, mRNAs, cDNAs, genomic DNAs, B- 
and Z-DNAs. Further embodiments of the invention include biologically, 
diagnostically, prophylactically, clinically or therapeutically useful polynucleotides and 
polypeptides, and variants thereof, and compositions comprising the same. 
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Another aspect of the invention relates to isolated polynucleotides, including at least one full 
length gene, that encodes a BASB027 polypeptide having a deduced amino acid sequence of 
SEQ ID NO:2 or 4 and polynucleotides closely related thereto and variants thereof. 

In another particularly preferred embodiment of the invention there is a BASB027 
polypeptide from Moraxella catarrhalis comprising or consisting of an amino acid 
sequence of SEQ ID NO:2 or 4 or a variant thereof. 

Using the information provided herein, such as a polynucleotide sequence set out in SEQ ID 
NO:l or 3, a polynucleotide of the invention encoding BASB027 polypeptide may be 
obtained using standard cloning and screening methods, such as those for cloning and 
sequencing chromosomal DNA fragments from bacteria using Moraxella catarrhalis Catiin 
cells as starting material, followed by obtaining a full length clone. For example, to obtain a 
polynucleotide sequence of the invention, such as a polynucleotide sequence given in 
SEQ ID NO:l or 3, typically a library of clones of chromosomal DNA of Moraxella 
catarrhalis Catiin in Exoli or some other suitable host is probed with a radiolabeled 
oligonucleotide, preferably a 1 7-mer or longer, derived from a partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using stringent 
hybridization conditions. By sequencing the individual clones thus identified by 
hybridization with sequencing primers designed from the original polypeptide or 
polynucleotide sequence it is then possible to extend the polynucleotide sequence in both 
directions to determine a full length gene sequence. Conveniently, such sequencing is 
performed, for example, using denatured double stranded DNA prepared from a plasmid 
clone. Suitable techniques are described by Maniatis, T., Fritsch, E.F. and Sambrook et 
aL, MOLECULAR CLONING, A LABORATORY MANUAL, 2nd Ed.; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989). (see in particular Screening By 
Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 13,70), 
Direct genomic DNA sequencing may also be performed to obtain a full length gene 
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sequence. Illustrative of the invention, each polynucleotide set out in SEQ ID NO:l or 3 
was discovered in a DNA library derived from Moraxella catarr halts. 

Moreover, each DNA sequence set out in SEQ ID NO: 1 or 3 contains an open reading frame 
encoding a protein having about the number of amino acid residues set forth in SEQ ID 
NO:2 or 4 with a deduced molecular weight that can be calculated using amino acid residue 
molecular weight values well known to those skilled in the art. 

The polynucleotide of SEQ ID NO: 1 , between the start codon at nucleotide number I and 
the stop codon which begins at nucleotide number 2440 of SEQ ID NO; 1 . encodes the 
polypeptide of SEQ ID NO:2. 

The polynucleotide of SEQ ID NO:3, between the start codon at nucleotide number 1 and 
the stop codon which begins at nucleotide number 2440 of SEQ ID NO:3. encodes the 
polypeptide of SEQ ID NO:4. 

In a further aspect, the present invention provides for an isolated polynucleotide 
comprising or consisting of: 

(a) a polynucleotide sequence which has at least 85% identity, preferably at least 90% 
identity, more preferably at least 95% identity, even more preferably at least 97-99% or 
exact identity to SEQ ID NO: 1 or 3 over the entire length of SEQ ID NO: 1 or 3 
respectively; or 

(b) a polynucleotide sequence encoding a polypeptide which has at least 85% identity, 
preferably at least 90% identity, more preferably at least 95% identity, even more 
preferably at least 97-99% or 100% exact, to the amino acid sequence of SEQ ID NO:2 
or 4, over the entire length of SEQ ID NO:2 or 4 respectively. 

A polynucleotide encoding a polypeptide of the present invention, including homologs and 
orthologs from species other than Moraxella catarrhalis, may be obtained by a process 
which comprises the steps of screening an appropriate library under stringent hybridization 
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conditions (for example, using a temperature in the range of 45 - 65°C and an SDS 
concentration from 0.1 - 1%) with a labeled or detectable probe consisting of or comprising 
the sequence of SEQ ID NO: 1 or 3 or a fragment thereof: and isolating a full-length gene 
and/or genomic clones containing said polynucleotide sequence. 

The invention provides a polynucleotide sequence identical over its entire length to a coding 
sequence (open reading frame) in SEQ ID NO: 1 or 3. Also provided by the invention is a 
coding sequence for a mature polypeptide or a fragment thereof, by itself as well as a coding 
sequence for a mature polypeptide or a fragment in reading frame with another coding 
sequence, such as a sequence encoding a leader or secretory sequence, a pre-, or pro- or 
prepro-protein sequence. The polynucleotide of the invention may also contain at least one 
non-coding sequence, including for example, but not limited to at least one non-coding 5' 
and 3 ' sequence, such as the transcribed but non-translated sequences, termination signals 
(such as rho-dependent and rho-independent termination signals), ribosome binding sites, 
Kozak sequences, sequences that stabilize mRNA, introns, and polyadenylation signals. 
The polynucleotide sequence may also comprise additional coding sequence encoding 
additional amino acids. For example, a marker sequence that facilitates purification of the 
fused polypeptide can be encoded. In certain embodiments of the invention, the marker 
sequence is a hexa-histidine peptide, as provided in the pQE vector (Qiagen, Inc.) and 
described in Gentz et aU Proa Natl. Acad ScL, USA 86: 821-824 (1989), or an HA peptide 
tag (Wilson et al n Cell 37; 767 (1984), both of which may be useful in purifying 
polypeptide sequence fused to them. Polynucleotides of the invention also include, but are 
not limited to, polynucleotides comprising a structural gene and its naturally associated 
sequences that control gene expression. 

The nucleotide sequence encoding BASB027 polypeptide of SEQ ID NO:2 or 4 may be 
identical to the polypeptide encoding sequence contained in nucleotides 1 to 2439 of SEQ 
ID NO: 1 or 3 respectively. Alternatively it may be a sequence, which as a result of the 
redundancy (degeneracy) of the genetic code, also encodes the polypeptide of SEQ ID 
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NO:2 or 4. 

The term "polynucleotide encoding a polypeptide" as used herein encompasses 
polynucleotides that include a sequence encoding a polypeptide of the invention, 
particularly a bacterial polypeptide and more particularly a polypeptide of the Moraxella 
catarrhalis B ASB027 having an amino acid sequence set out in SEQ ID NO:2 or 4. The 
term also encompasses polynucleotides that include a single continuous region or 
discontinuous regions encoding the polypeptide (for example, polynucleotides interrupted 
by integrated phage, an integrated insertion sequence, an integrated vector sequence, an 
integrated transposon sequence, or due to RNA editing or genomic DNA reorganization) 
together with additional regions, that also may contain coding and/or non-coding sequences. 

The invention further relates to variants of the polynucleotides described herein that encode 
variants of a polypeptide having a deduced amino acid sequence of SEQ ID NO:2 or 4. 
Fragments of polynucleotides of the invention may be used, for example, to synthesize full- 
length polynucleotides of the invention. 

Further particularly preferred embodiments are polynucleotides encoding BASB027 
variants, that have the amino acid sequence of BASB027 polypeptide of SEQ ID NO:2 or 4 
in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues are substituted, 
modified, deleted and/or added, in any combination. Especially preferred among these are 
silent substitutions, additions and deletions, that do not alter the properties and activities of 
BASB027 polypeptide. 

Further preferred embodiments of the invention are polynucleotides that are at least 85% 
identical over their entire length to a polynucleotide encoding BASB027 polypeptide having 
an amino acid sequence set out in SEQ ID NO:2 or 4, and polynucleotides that are 
complementary to such polynucleotides. Alternatively, most highly preferred are 
polynucleotides that comprise a region that is at least 90% identical over its entire length to 
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a polynucleotide encoding BASB027 polypeptide and polynucleotides complementary 
thereto. In this regard, polynucleotides at least 95% identical over their entire length to the 
same are particularly preferred. Furthermore, those with at least 97% are highly preferred 
among those with at least 95%. and among these those with at least 98% and at least 99% 
are particularly highly preferred, with at least 99% being the more preferred. 

Preferred embodiments are polynucleotides encoding polypeptides that retain substantially 
the same biological function or activity as the mature polypeptide encoded by a DNA of 
SEQ IDNO:l or 3. 

In accordance with certain preferred embodiments of this invention there are provided 
polynucleotides that hybridize, particularly under stringent conditions, to BASB027 
polynucleotide sequences, such as those polynucleotides in SEQ ID NO:l or 3. 

The invention further relates to polynucleotides that hybridize to the polynucleotide 
sequences provided herein. In this regard, the invention especially relates to polynucleotides 
that hybridize under stringent conditions to the polynucleotides described herein. As herein 
used, the terms "stringent conditions" and "stringent hybridization conditions" mean 
hybridization occurring only if there is at least 95% and preferably at least 97% identity 
between the sequences. A specific example of stringent hybridization conditions is 
overnight incubation at 42°C in a solution comprising: 50% formamide, 5x SSC (150mM 
NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x Denhardt's 
solution, 10% dextran sulfate, and 20 micrograms/ml of denatured, sheared salmon sperm 
DNA, followed by washing the hybridization support in 0.1 x SSC at about 65°C. 
Hybridization and wash conditions are well known and exemplified in Sambrook, et aL, 
Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., 
(1989), particularly Chapter 1 1 therein. Solution hybridization may also be used with the 
polynucleotide sequences provided by the invention. 
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The invention also provides a polynucleotide consisting of or comprising a polynucleotide 
sequence obtained by screening an appropriate library containing the complete gene for a 
polynucleotide sequence set forth in SEQ ID NO:l or 3 under stringent hybridization 
conditions with a probe having the sequence of said polynucleotide sequence set forth in 
SEQ ID NO:l or 3 or a fragment thereof; and isolating said polynucleotide sequence. 
Fragments useful for obtaining such a polynucleotide include, for example, probes and 
primers fully described elsewhere herein. 

As discussed elsewhere herein regarding polynucleotide assays of the invention, for 
instance, the polynucleotides of the invention, may be used as a hybridization probe for 
RN A, cDN A and genomic DNA to isolate full-length cDNAs and genomic clones encoding 
BASB027 and to isolate cDNA and genomic clones of other genes that have a high identity, 
particularly high sequence identity, to the BASB027 gene. Such probes generally will 
comprise at least 15 nucleotide residues or base pairs. Preferably, such probes will have at 
least 30 nucleotide residues or base pairs and may have at least 50 nucleotide residues or 
base pairs. Particularly preferred probes will have at least 20 nucleotide residues or base 
pairs and will have less than 30 nucleotide residues or base pairs. 

A coding region of a B ASB027 gene may be isolated by screening using a DNA sequence 
provided in SEQ ID NO:l or 3 to synthesize an oligonucleotide probe. A labeled 
oligonucleotide having a sequence complementary to that of a gene of the invention is then 
used to screen a library of cDNA, genomic DNA or mRNA to determine which members of 
the library the probe hybridizes to. 

There are several methods available and well known to those skilled in the art to obtain 
full-length DNAs, or extend short DNAs, for example those based on the method of Rapid 
Amplification of cDNA ends (RACE) (see, for example, Frohman, et aL, PNAS USA 85: 
8998-9002, 1988). Recent modifications of the technique, exemplified by the Marathon™ 
technology (Clontech Laboratories Inc.) for example, have significantly simplified the 



15 



WO 99/63093 



PCT/EP99/03822 



search for longer cDNAs. In the Marathon™ technology, cDNAs have been prepared 
from mRNA extracted from a chosen tissue and an 'adaptor' sequence ligated onto each 
end. Nucleic acid amplification (PCR) is then carried out to amplify the "missing" 5' end 
of the DNA using a combination of gene specific and adaptor specific oligonucleotide 
primers. The PCR reaction is then repeated using "nested 11 primers, that is, primers 
designed to anneal within the amplified product (typically an adaptor specific primer that 
anneals further 3' in the adaptor sequence and a gene specific primer that anneals further 5' 
in the selected gene sequence). The products of this reaction can then be analyzed by 
DNA sequencing and a full-length DNA constructed either by joining the product directly 
to the existing DNA to give a complete sequence, or carrying out a separate full-length 
PCR using the new sequence information for the design of the 5' primer. 

The polynucleotides and polypeptides of the invention may be employed, for example, as 
research reagents and materials for discovery of treatments of and diagnostics for diseases, 
particularly human diseases, as further discussed herein relating to polynucleotide assays. 

The polynucleotides of the invention that are oligonucleotides derived from a sequence of 
SEQ ID NOS:l or 3 may be used in the processes herein as described, but preferably for 
PCR, to determine whether or not the polynucleotides identified herein in whole or in part 
are transcribed in bacteria in infected tissue. It is recognized that such sequences will also 
have utility in diagnosis of the stage of infection and type of infection the pathogen has 
attained. 

The invention also provides polynucleotides that encode a polypeptide that is the mature 
protein plus additional amino or carboxyl-terminal amino acids, or amino acids interior to 
the mature polypeptide (when the mature form has more than one polypeptide chain, for 
instance). Such sequences may play a role in processing of a protein from precursor to a 
mature form, may allow protein transport, may lengthen or shorten protein half-life or may 
facilitate manipulation of a protein for assay or production, among other things. As 
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generally is the case in vivo, the additional amino acids may be processed away from the 
mature protein by cellular enzymes. 

For each and every polynucleotide of the invention there is provided a polynucleotide 
complementary to it. It is preferred that these complementary polynucleotides are fully 
complementary to each polynucleotide with which they are complementary. 

A precursor protein, having a mature form of the polypeptide fused to one or more 
prosequences may be an inactive form of the polypeptide. When prosequences are removed 
such inactive precursors generally are activated. Some or all of the prosequences may be 
removed before activation. Generally, such precursors are called proproteins. 

In addition to the standard A, G, C, T/U representations for nucleotides, the term "N" may 
also be used in describing certain polynucleotides of the invention. "N" means that any of 
the four DNA or RNA nucleotides may appear at such a designated position in the DNA 
or RNA sequence, except it is preferred that N is not a nucleic acid that when taken in 
combination with adjacent nucleotide positions, when read in the correct reading frame, 
would have the effect of generating a premature termination codon in such reading frame. 

In sum, a polynucleotide of the invention may encode a mature protein, a mature protein 
plus a leader sequence (which may be referred to as a preprotein), a precursor of a mature 
protein having one or more prosequences that are not the leader sequences of a preprotein, 
or a preproproteim which is a precursor to a proprotein, having a leader sequence and one or 
more prosequences, which generally are removed during processing steps that produce 
active and mature forms of the polypeptide. 

In accordance with an aspect of the invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunization. 
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The use of a polynucleotide of the invention in genetic immunization will preferably 
employ a suitable delivery method such as direct injection of piasmid DNA into muscles 
(Wolff etaL Hum Mol Genet (1992) 1 : 363, Manthorpe et al, Hum. Gene Ther. (1983)4: 
419), delivery of DNA complexed with specific protein carriers (Wu et al.^J Biol Chem. 
(1989) 264: 16985), coprecipitation of DNA with calcium phosphate (Benvenisty & 
Reshef. PNAS USA, (1986) 83: 9551), encapsulation of DNA in various forms of 
liposomes (Kaneda et al, Science (1989) 243: 375). particle bombardment (Tang et al, 
Nature (1992) 356:152, Eisenbraun et al, DNA Cell Biol (1993) 12: 791) and in vivo 
infection using cloned retroviral vectors (Seeger et al, PNAS USA (1984) 81 : 5849). 

Vectors, Host Cells, Expression Systems 

The invention also relates to vectors that comprise a polynucleotide or polynucleotides of 
the invention, host cells that are genetically engineered with vectors of the invention and the 
production of polypeptides of the invention by recombinant techniques. Cell-free 
translation systems can also be employed to produce such proteins using RNAs derived 
from the DNA constructs of the invention. 

Recombinant polypeptides of the present invention may be prepared by processes well 
known in those skilled in the art from genetically engineered host cells comprising 
expression systems. Accordingly, in a further aspect, the present invention relates to 
expression systems that comprise a polynucleotide or polynucleotides of the present 
invention, to host cells which are genetically engineered with such expression systems, and 
to the production of polypeptides of the invention by recombinant techniques. 

For recombinant production of the polypeptides of the invention, host cells can be 
genetically engineered to incorporate expression systems or portions thereof or 
polynucleotides of the invention. Introduction of a polynucleotide into the host cell can be 



18 



WO 99/63093 



PCT/EP99/03822 



effected by methods described in many standard laboratory manuals, such as Davis, et al, 
BASIC METHODS IN MOLECULAR BIOLOGY, (1986) and Sambrook, etal, 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. (1989), such as, calcium phosphate 
transfection, DEAE-dextran mediated transfection, transvection, microinjection, cationic 
lipid-mediated transfection, electroporation, transduction, scrape loading, ballistic 
introduction and infection. 

Representative examples of appropriate hosts include bacterial cells, such as cells of 
streptococci, staphylococci, enterococci, E. coli 7 streptomyces, cyanobacteria. Bacillus 
subtilis, Neisseria meningitidis and Moraxella catarrhalis: fungal cells, such as cells of a 
yeast Kluveromyces. Saccharomyces, a basidiomycete, Candida albicans and Aspergillus] 
insect cells such as cells of Drosophila S2 and Spodoptera Sf9; animal cells such as CHO, 
COS, HeLa, C127, 3T3, BHK, 293, CV-1 and Bowes melanoma cells; and plant cells, such 
as cells of a gymnosperm or angiosperm. 

A great variety of expression systems can be used to produce the polypeptides of the 
invention. Such vectors include, among others, chromosomal-, episomal- and virus-derived 
vectors, for example, vectors derived from bacterial plasmids, from bacteriophage, from 
transposons, from yeast episomes, from insertion elements, from yeast chromosomal 
elements, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses, picornaviruses, retroviruses, 
and alphaviruses and vectors derived from combinations thereof, such as those derived from 
plasmid and bacteriophage genetic elements, such as cosmids and phagemids. The 
expression system constructs may contain control regions that regulate as well as engender 
expression. Generally, any system or vector suitable to maintain, propagate or express 
polynucleotides and/or to express a polypeptide in a host may be used for expression in this 
regard. The appropriate DN A sequence may be inserted into the expression system by any 
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of a variety of well-known and routine techniques, such as, for example, those set forth in 
Sambrook et aL MOLECULAR CLONING, A LABORATORY MANUAL, {supra). 

In recombinant expression systems in eukaryotes. for secretion of a translated protein into 
the lumen of the endoplasmic reticulum, into the periplasmic space or into the extracellular 
environment, appropriate secretion signals may be incorporated into the expressed 
polypeptide. These signals may be endogenous to the polypeptide or they may be 
heterologous signals. 

Polypeptides of the present invention can be recovered and purified from recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
hydroxylapatite chromatography and lectin chromatography. Most preferably, ion metal 
affinity chromatography (IMAC) is employed for purification. Well known techniques 
for refolding proteins may be employed to regenerate active conformation when the 
polypeptide is denatured during intracellular synthesis, isolation and or purification. 

The expression system may also be a recombinant live microorganism, such as a virus 
or bacterium. The gene of interest can be inserted into the genome of a live recombinant 
virus or bacterium. Inoculation and in vivo infection with this live vector will lead to in 
vivo expression of the antigen and induction of immune responses. Viruses and bacteria 
used for this purpose are for instance: poxviruses (e.g; vaccinia, fowlpox, canarypox), 
alphaviruses (Sindbis virus, Semliki Forest Virus, Venezuelian Equine Encephalitis 
Virus), adenoviruses, adeno-associated virus, picornaviruses (poliovirus, rhinovirus), 
herpesviruses (varicella zoster virus, etc), Listeria, Salmonella , Shigella, BCG. These 
viruses and bacteria can be virulent, or attenuated in various ways in order to obtain live 
vaccines. Such live vaccines also form part of the invention. 
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Diagnostic, Prognostic, Serotyping and Mutation Assays 

This invention is also related to the use of BASB027 polynucleotides and polypeptides of 
the invention for use as diagnostic reagents. Detection of BASB027 polynucleotides and/or 
polypeptides in a eukaryote, particularly a mammal, and especially a human, will provide a 
diagnostic method for diagnosis of disease, staging of disease or response of an infectious 
organism to drugs. Eukaryotes. particularly mammals, and especially humans, particularly 
those infected or suspected to be infected with an organism comprising the BASB027 gene 
or protein, may be detected at the nucleic acid or amino acid level by a variety of well 
known techniques as well as by methods provided herein. 

Polypeptides and polynucleotides for prognosis, diagnosis or other analysis may be obtained 
from a putatively infected and/or infected individual's bodily materials. Polynucleotides 
from any of these sources, particularly DNA or RNA, may be used directly for detection or 
may be amplified enzymatically by using PCR or any other amplification technique prior to 
analysis. RNA, particularly mRNA, cDNA and genomic DNA may also be used in the 
same ways. Using amplification, characterization of the species and strain of infectious or 
resident organism present in an individual, may be made by an analysis of the genotype of a 
selected polynucleotide of the organism. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to a genotype of a reference sequence 
selected from a related organism, preferably a different species of the same genus or a 
different strain of the same species. Point mutations can be identified by hybridizing 
amplified DNA to labeled BASB027 polynucleotide sequences. Perfectly or significantly 
matched sequences can be distinguished from imperfectly or more significantly mismatched 
duplexes by DNase or RNase digestion, for DNA or RNA respectively, or by detecting 
differences in melting temperatures or renaturation kinetics. Polynucleotide sequence 
differences may also be detected by alterations in the electrophoretic mobility of 
polynucleotide fragments in gels as compared to a reference sequence. This may be carried 
out with or without denaturing agents. Polynucleotide differences may also be detected by 
direct DNA or RNA sequencing. See, for example, Myers et aL Science, 230: 1242 (1985). 
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Sequence changes at specific locations also may be revealed by nuclease protection assays, 
such as RNase, VI and SI protection assay or a chemical cleavage method. See, for 
example. Cotton etal, Proc. Nad. Acad ScL, USA, 85: 4397-4401 (1985). 

In another embodiment, an array of oligonucleotides probes comprising BASB027 
nucleotide sequence or fragments thereof can be constructed to conduct efficient screening 
of, for example, genetic mutations, serotype, taxonomic classification or identification. 
Array technology methods are well known and have general applicability and can be used to 
address a variety of questions in molecular genetics including gene expression, genetic 
linkage, and genetic variability (see, for example, Chee et aL, Science, 274: 610 (1996)). 

Thus in another aspect, the present invention relates to a diagnostic kit which comprises: 

(a) a polynucleotide of the present invention, preferably the nucleotide sequence of SEQ 
ID NO:l or 3, or a fragment thereof ; 

(b) a nucleotide sequence complementary to that of (a); 

(c) a polypeptide of the present invention, preferably the polypeptide of SEQ ID NO:2 or 
4 or a fragment thereof; or 

(d) an antibody to a polypeptide of the present invention, preferably to the polypeptide of 
SEQ ID NO:2 or 4. 

It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 
component. Such a kit will be of use in diagnosing a disease or susceptibility to a 
Disease, among others. 

This invention also relates to the use of polynucleotides of the present invention as 
diagnostic reagents. Detection of a mutated form of a polynucleotide of the invention, 
preferably SEQ ID NO:l or 3, which is associated with a disease or pathogenicity will 
provide a diagnostic tool that can add to, or define, a diagnosis of a disease, a prognosis of a 
course of disease, a determination of a stage of disease, or a susceptibility to a disease. 
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which results from under-expressioru over-expression or altered expression of the 
polynucleotide. Organisms, particularly infectious organisms, carrying mutations in such 
polynucleotide may be detected at the polynucleotide level by a variety of techniques, such 
as those described elsewhere herein. 

Ceils from an organism carrying mutations or polymorphisms (allelic variations) in a 
polynucleotide and/or polypeptide of the invention may also be detected at the 
polynucleotide or polypeptide level by a variety of techniques, to allow for serotyping, for 
example. For example, RT-PCR can be used to detect mutations in the RNA. It is 
particularly preferred to use RT-PCR in conjunction with automated detection systems, such 
as. for example, GeneScan. RNA, cDNA or genomic DNA may also be used for the same 
purpose. PCR. As an example, PCR primers complementary to a polynucleotide encoding 
BASB027 polypeptide can be used to identify and analyze mutations. 

The invention further provides primers with 1, 2, 3 or 4 nucleotides removed from the 5 1 
and/or the 3 r end. These primers may be used for, among other things, amplifying 
BASB027 DNA and/or RNA isolated from a sample derived from an individual, such as a 
bodily material. The primers may be used to amplify a polynucleotide isolated from an 
infected individual, such that the polynucleotide may then be subject to various techniques 
for elucidation of the polynucleotide sequence. In this way, mutations in the polynucleotide 
sequence may be detected and used to diagnose and/or prognose the infection or its stage or 
course, or to serotype and/or classify the infectious agent. 

The invention further provides a process for diagnosing, disease, preferably bacterial 
infections, more preferably infections caused by Moraxella catarrhalis, comprising 
determining from a sample derived from an individual, such as a bodily material, an 
increased level of expression of polynucleotide having a sequence of SEQ ID NO: 1 or 3. 
Increased or decreased expression of a BASB027 polynucleotide can be measured using 
any on of the methods well known in the art for the quantitation of polynucleotides, such 
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as, for example, amplification, PCR, RT-PCR, RNase protection. Northern blotting, 
spectrometry and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of BASB027 polypeptide compared to normal control tissue samples may be 
used to detect the presence of an infection, for example. Assay techniques that can be used 
to determine levels of a BASB027 polypeptide, in a sample derived from a host, such as a 
bodily material, are well-known to those of skill in the art. Such assay methods include 
radioimmunoassays, competitive-binding assays. Western Biot analysis, antibody sandwich 
assays, antibody detection and ELISA assays. 

The polynucleotides of the invention may be used as components of polynucleotide 
arrays, preferably high density arrays or grids. These high density arrays are 
particularly useful for diagnostic and prognostic purposes. For example, a set of spots 
each comprising a different gene, and further comprising a polynucleotide or 
polynucleotides of the invention, may be used for probing, such as using hybridization 
or nucleic acid amplification, using a probes obtained or derived from a bodily sample, 
to determine the presence of a particular polynucleotide sequence or related sequence in 
an individual. Such a presence may indicate the presence of a pathogen, particularly 
Moraxella catarrhalis, and may be useful in diagnosing and/or prognosing disease or a 
course of disease. A grid comprising a number of variants of the polynucleotide 
sequence of SEQ ID NO:l or 3 are preferred. Also preferred is a comprising a number 
of variants of a polynucleotide sequence encoding the polypeptide sequence of SEQ ID 
NO:2 or 4. 

Antibodies 

The polypeptides and polynucleotides of the invention or variants thereof, or cells 
expressing the same can be used as immunogens to produce antibodies immunospecific for 
such polypeptides or polynucleotides respectively. 
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In certain preferred embodiments of the invention there are provided antibodies against 
BASB027 polypeptides or polynucleotides. 

Antibodies generated against the polypeptides or polynucleotides of the invention can be 
obtained by administering the polypeptides and/or polynucleotides of the invention, or 
epitope-bearing fragments of either or both, analogues of either or both, or cells expressing 
either or both, to an animal, preferably a nonhuman. using routine protocols. For 
preparation of monoclonal antibodies, any technique known in the art that provides 
antibodies produced by continuous cell line cultures can be used. Examples include various 
techniques, such as those in Kohler, G. and Milstein, C, Nature 256: 495-497 (1975); 
Kozbor et al . Immunology Today 4: 72 (1983); Cole et ai, pg. 77-96 in MONOCLONAL 
ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc. (1985). 

Techniques for the production of single chain antibodies (U.S. Patent No, 4,946,778) can be 
adapted to produce single chain antibodies to polypeptides or polynucleotides of this 
invention. Also, transgenic mice, or other organisms or animals, such as other mammals, 
may be used to express humanized antibodies immunospecific to the polypeptides or 
polynucleotides of the invention. 

Alternatively, phage display technology may be utilized to select antibody genes with 
binding activities towards a polypeptide of the invention either from repertoires of PCR 
amplified v-genes of lymphocytes from humans screened for possessing anti-BASB027 or 
from naive libraries (McCafferty, et al y (1990), Nature 348, 552-554; Marks, et ah, 
(1992) Biotechnology 10, 779-783), The affinity of these antibodies can also be improved 
by, for example, chain shuffling (Clackson et al, (1991) Nature 352: 628). 
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The above-described antibodies may be employed to isolate or to identify clones expressing 
the polypeptides or polynucleotides of the invention to purify the polypeptides or 
polynucleotides by. for example, affinity chromatography. 

Thus, among others, antibodies against BASB027-polypeptide or BASB027-polynucleotide 
may be employed to treat infections, particularly bacterial infections. 

Polypeptide variants include antigenically, epitopically or immunologically equivalent 
variants form a particular aspect of this invention. 

Preferably, the antibody or variant thereof is modified to make it less immunogenic in the 
individual. For example, if the individual is human the antibody may most preferably be 
"humanized," where the complimentarity determining region or regions of the hybridoma- 
derived antibody has been transplanted into a human monoclonal antibody, for example as 
described in Jones et al (1986), Nature 321, 522-525 or Tempest et ai y (1991) 
Biotechnology 9, 266-273. 

Antagonists and Agonists - Assays and Molecules 

Polypeptides and polynucleotides of the invention may also be used to assess the binding of 
small molecule substrates and ligands in, for example, cells, cell-free preparations, chemical 
libraries, and natural product mixtures. These substrates and ligands may be natural 
substrates and ligands or may be structural or functional mimetics. See, e.g., Coligan et al, 
Current Protocols in Immunology 1(2); Chapter 5 (1991). 

The screening methods may simply measure the binding of a candidate compound to the 
polypeptide or polynucleotide, or to cells or membranes bearing the polypeptide or 
polynucleotide, or a fusion protein of the polypeptide by means of a label directly or 
indirectly associated with the candidate compound. Alternatively, the screening method 
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may involve competition with a labeled competitor. Further, these screening methods 
may test whether the candidate compound results in a signal generated by activation or 
inhibition of the polypeptide or polynucleotide, using detection systems appropriate to the 
cells comprising the polypeptide or polynucleotide. Inhibitors of activation are generally- 
assayed in the presence of a known agonist and the effect on activation by the agonist by 
the presence of the candidate compound is observed. Constitutively active polypeptide 
and/or constitutively expressed polypeptides and polynucleotides may be employed in 
screening methods for inverse agonists or inhibitors, in the absence of an agonist or 
inhibitor, by testing whether the candidate compound results in inhibition of activation of 
the polypeptide or polynucleotide, as the case may be. Further, the screening methods 
may simply comprise the steps of mixing a candidate compound with a solution 
containing a polypeptide or polynucleotide of the present invention, to form a mixture, 
measuring BASB027 polypeptide and/or polynucleotide activity in the mixture, and 
comparing the BASB027 polypeptide and/or polynucleotide activity of the mixture to a 
standard. Fusion proteins, such as those made from Fc portion and BASB027 
polypeptide, as hereinbefore described, can also be used for high-throughput screening 
assays to identify antagonists of the polypeptide of the present invention, as well as of 
phylogenetically and and/or functionally related polypeptides (see D. Bennett et al, J Mol 
Recognition, 8:52-58 (1995); and K. Johanson et a/., J Biol Chem, 270(16):9459-9471 
(1995)). 

The polynucleotides, polypeptides and antibodies that bind to and/or interact with a 
polypeptide of the present invention may also be used to configure screening methods for 
detecting the effect of added compounds on the production of mRNA and/or polypeptide 
in cells. For example, an ELISA assay may be constructed for measuring secreted or cell 
associated levels of polypeptide using monoclonal and polyclonal antibodies by standard 
methods known in the art. This can be used to discover agents which may inhibit or 
enhance the production of polypeptide (also called antagonist or agonist, respectively) 
from suitably manipulated cells or tissues. 
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The invention also provides a method of screening compounds to identify those which 
enhance (agonist) or block (antagonist) the action of BASB027 polypeptides or 
polynucleotides, particularly those compounds that are bacteriostatic and/or bactericidal. 
The method of screening may involve high-throughput techniques. For example, to screen 
for agonists or antagonists, a synthetic reaction mix, a cellular compartment, such as a 
membrane, cell envelope or cell wall or a preparation of any thereof, comprising BASB027 
polypeptide and a labeled substrate or ligand of such polypeptide is incubated in the absence 
or the presence of a candidate molecule that may be a BASB027 agonist or antagonist. The 
ability of the candidate molecule to agonize or antagonize the BASB027 polypeptide is 
reflected in decreased binding of the labeled ligand or decreased production of product from 
such substrate. Molecules that bind gratuitously, i.e., without inducing the effects of 
BASB027 polypeptide are most likely to be good antagonists. Molecules that bind well 
and, as the case may be, increase the rate of product production from substrate, increase 
signal transduction, or increase chemical channel activity are agonists. Detection of the rate 
or level of, as the case may be, production of product from substrate, signal transduction, or 
chemical channel activity may be enhanced by using a reporter system. Reporter systems 
that may be useful in this regard include but are not limited to colorimetric, labeled substrate 
converted into product, a reporter gene that is responsive to changes in BASB027 
polynucleotide or polypeptide activity, and binding assays known in the art. 

Another example of an assay for BASB027 agonists is a competitive assay that combines 
BASB027 and a potential agonist with BASB027-binding molecules, recombinant 
BASB027 binding molecules, natural substrates or ligands, or substrate or ligand mimetics, 
under appropriate conditions for a competitive inhibition assay. BASB027 can be labeled, 
such as by radioactivity or a colorimetric compound, such that the number of BASB027 
molecules bound to a binding molecule or converted to product can be determined 
accurately to assess the effectiveness of the potential antagonist. 
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Potential antagonists include, among others, small organic molecules, peptides, polypeptides 
and antibodies that bind to a polynucleotide and/or polypeptide of the invention and thereby 
inhibit or extinguish its activity or expression. Potential antagonists also may be small 
organic molecules, a peptide, a polypeptide such as a closely related protein or antibody that 
binds the same sites on a binding molecule, such as a binding molecule, without inducing 
BASB027-induced activities, thereby preventing the action or expression of BASB027 
polypeptides and/or polynucleotides by excluding BASB027 polypeptides and/or 
polynucleotides from binding. 

Potential antagonists include a small molecule that binds to and occupies the binding site of 
the polypeptide thereby preventing binding to cellular binding molecules, such that normal 
biological activity is prevented. Examples of small molecules include but are not limited to 
small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, J. Neurochem. 56: 560 (1991); 
OLIGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION 
CRC Press, Boca Raton, FL (1988), for a description of these molecules). Preferred 
potential antagonists include compounds related to and variants of BASB027. 

In a further aspect, the present invention relates to genetically engineered soluble fusion 
proteins comprising a polypeptide of the present invention, or a fragment thereof, and 
various portions of the constant regions of heavy or light chains of immunoglobulins of 
various subclasses (IgG, IgM, IgA, IgE). Preferred as an immunoglobulin is the constant 
part of the heavy chain of human IgG, particularly IgGl , where fusion takes place at the 
hinge region. In a particular embodiment, the Fc part can be removed simply by 
incorporation of a cleavage sequence which can be cleaved with blood clotting factor Xa. 
Furthermore, this invention relates to processes for the preparation of these fusion 
proteins by genetic engineering, and to the use thereof for drug screening, diagnosis and 
therapy. A further aspect of the invention also relates to polynucleotides encoding such 
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fusion proteins. Examples of fusion protein technology can be found in International 
Patent Application Nos. W094/29458 and W094/22914. 

Each of the polynucleotide sequences provided herein may be used in the discovery and 
development of antibacterial compounds. The encoded protein, upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the polynucleotide 
sequences encoding the amino terminal regions of the encoded protein or Shine-Delgarno 
or other translation facilitating sequences of the respective mRNA can be used to 
construct antisense sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide, agonist or 
antagonist of the invention to interfere with the initial physical interaction between a 
pathogen or pathogens and a eukaryotic, preferably mammalian, host responsible for 
sequelae of infection. In particular, the molecules of the invention may be used: in the 
prevention of adhesion of bacteria, in particular gram positive and/or gram negative 
bacteria, to eukaryotic, preferably mammalian, extracellular matrix proteins on in- 
dwelling devices or to extracellular matrix proteins in wounds; to block bacterial adhesion 
between eukaryotic, preferably mammalian, extracellular matrix proteins and bacterial 
BASB027 proteins that mediate tissue damage and/or; to block the normal progression of 
pathogenesis in infections initiated other than by the implantation of in-dwelling devices 
or by other surgical techniques. 

In accordance with yet another aspect of the invention, there are provided BASB027 
agonists and antagonists, preferably bacteristatic or bactericidal agonists and antagonists. 

The antagonists and agonists of the invention may be employed, for instance, to prevent, 
inhibit and/or treat diseases. 
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In a further aspect, the present invention relates to mimotopes of the polypeptide of the 
invention. A mimotope is a peptide sequence, sufficiently similar to the native peptide 
(sequentially or structurally), which is capable of being recognised by antibodies which 
recognise the native peptide: or is capable of raising antibodies which recognise the 
native peptide when coupled to a suitable carrier. 

Peptide mimotopes may be designed for a particular purpose by addition, deletion or 
substitution of elected amino acids. Thus, the peptides may be modified for the purposes 
of ease of conjugation to a protein carrier. For example, it may be desirable for some 
chemical conjugation methods to include a terminal cysteine. In addition it may be 
desirable for peptides conjugated to a protein carrier to include a hydrophobic terminus 
distal from the conjugated terminus of the peptide, such that the free unconjugated end 
of the peptide remains associated with the surface of the carrier protein. Thereby 
presenting the peptide in a conformation which most closely resembles that of the 
peptide as found in the context of the whole native molecule. For example, the peptides 
may be altered to have an N-terminal cysteine and a C-terminal hydrophobic amidated 
tail. Alternatively, the addition or substitution of a D-stereoisomer form of one or more 
of the amino acids may be performed to create a beneficial derivative, for example to 
enhance stability of the peptide. 

Alternatively, peptide mimotopes may be identified using antibodies which are capable 
themselves of binding to the polypeptides of the present invention using techniques such 
as phage display technology (EP 0 552 267 Bl). This technique, generates a large number 
of peptide sequences which mimic the structure of the native peptides and are, therefore, 
capable of binding to anti-native peptide antibodies, but may not necessarily themselves 
share significant sequence homology to the native polypeptide. 

Vaccines 
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Another aspect of the invention relates to a method for inducing an immunological 
response in an individual, particularly a mammal, preferably humans, which comprises 
inoculating the individual with BASB027 polynucleotide and/or polypeptide, or a 
fragment or variant thereof, adequate to produce antibody and/ or T cell immune response 
to protect said individual from infection, particularly bacterial infection and most 
particularly Moraxella catarrhalis infection. Also provided are methods whereby such 
immunological response slows bacterial replication. Yet another aspect of the invention 
relates to a method of inducing immunological response in an individual which comprises 
delivering to such individual a nucleic acid vector, sequence or ribozyme to direct 
expression of BASB027 polynucleotide and/or polypeptide, or a fragment or a variant 
thereof, for expressing BASB027 polynucleotide and/or polypeptide, or a fragment or a 
variant thereof in vivo in order to induce an immunological response, such as, to produce 
antibody and/ or T cell immune response, including, for example, cytokine-producing T 
cells or cytotoxic T cells, to protect said individual, preferably a human, from disease, 
whether that disease is already established within the individual or not. One example of 
administering the gene is by accelerating it into the desired cells as a coating on particles 
or otherwise. Such nucleic acid vector may comprise DNA, RNA, a ribozyme, a modified 
nucleic acid, a DNA/RNA hybrid, a DNA-protein complex or an RNA-protein complex. 

A further aspect of the invention relates to an immunological composition that when 
introduced into an individual, preferably a human, capable of having induced within it an 
immunological response, induces an immunological response in such individual to a 
BASB027 polynucleotide and/or polypeptide encoded therefrom, wherein the composition 
comprises a recombinant BASB027 polynucleotide and/or polypeptide encoded therefrom 
and/or comprises DNA and/or RNA which encodes and expresses an antigen of said 
BASB027 polynucleotide, polypeptide encoded therefrom, or other polypeptide of the 
invention. The immunological response may be used therapeutically or prophylactically 
and may take the form of antibody immunity and/or cellular immunity, such as cellular 
immunity arising from CTL or CD4+ T cells. 
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A BASB027 polypeptide or a fragment thereof may be fused with co-protein or chemical 
moiety which may or may not by itself produce antibodies, but which is capable of 
stabilizing the first protein and producing a fused or modified protein which will have 
antigenic and/or immunogenic properties, and preferably protective properties. Thus 
fused recombinant protein, preferably further comprises an antigenic co-protein, such as 
lipoprotein D from Haemophilus influenzae, Glutathione-S-transferase (GST) or beta- 
gal actosidase. or any other relatively large co-protein which solubilizes the protein and 
facilitates production and purification thereof. Moreover, the co-protein may act as an 
adjuvant in the sense of providing a generalized stimulation of the immune system of the 
organism receiving the protein. The co-protein may be attached to either the amino- or 
carboxy-terminus of the first protein. 

Provided by this invention are compositions, particularly vaccine compositions, and 
methods comprising the polypeptides and/or polynucleotides of the invention and 
immunostimulatory DNA sequences, such as those described in Sato, Y. et ah Science 
273:352 (1996). 

Also, provided by this invention are methods using the described polynucleotide or 
particular fragments thereof, which have been shown to encode non-variable regions of 
bacterial cell surface proteins, in polynucleotide constructs used in such genetic 
immunization experiments in animal models of infection with Moraxella catarrhalis. 
Such experiments will be particularly useful for identifying protein epitopes able to 
provoke a prophylactic or therapeutic immune response. It is believed that this approach 
will allow for the subsequent preparation of monoclonal antibodies of particular value, 
derived from the requisite organ of the animal successfully resisting or clearing infection, 
for the development of prophylactic agents or therapeutic treatments of bacterial infection, 
particularly Moraxella catarrhalis infection, in mammals, particularly humans. 
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The invention also includes a vaccine formulation which comprises an immunogenic 
recombinant polypeptide and/or polynucleotide of the invention together with a suitable 
carrier, such as a pharmaceutical^ acceptable carrier. Since the polypeptides and 
polynucleotides may be broken down in the stomach, each is preferably administered 
parenterals, including, for example, administration that is subcutaneous, intramuscular, 
intravenous, or intradermal Formulations suitable for parenteral administration include 
aqueous and non-aqueous sterile injection solutions which may contain anti-oxidants. 
buffers, bacteriostatic compounds and solutes which render the formulation isotonic with 
the bodily fluid, preferably the blood, of the individual; and aqueous and non-aqueous 
sterile suspensions which may include suspending agents or thickening agents. The 
formulations may be presented in unit-dose or multi-dose containers, for example, sealed 
ampoules and vials and may be stored in a freeze-dried condition requiring only the 
addition of the sterile liquid carrier immediately prior to use. 

The vaccine formulation of the invention may also include adjuvant systems for 
enhancing the immunogenicity of the formulation. Preferably the adjuvant system 
raises preferentially a TH1 type of response. 

An immune response may be broadly distinguished into two extreme catagories, being a 
humoral or cell mediated immune responses (traditionally characterised by antibody and 
cellular effector mechanisms of protection respectively). These categories of response 
have been termed THl-type responses (cell-mediated response), and TH2-type immune 
responses (humoral response). 

Extreme THl-type immune responses may be characterised by the generation of antigen 
specific, hapiotype restricted cytotoxic T lymphocytes, and natural killer cell responses. 
In mice THl-type responses are often characterised by the generation of antibodies of 
the IgG2a subtype, whilst in the human these correspond to IgGl type antibodies. TH2- 
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type immune responses are characterised by the generation of a broad range of 
immunoglobulin isotypes including in mice IgGl, IgA, and IgM. 

It can be considered that the driving force behind the development of these two types of 
immune responses are cytokines. High levels of THl-type cytokines tend to favour the 
induction of cell mediated immune responses to the given antigen, whilst high levels of 
TH2-type cytokines tend to favour the induction of humoral immune responses to the 
antigen. 

The distinction of TH1 and TH2-type immune responses is not absolute. In reality an 
individual will support an immune response which is described as being predominantly 
TH1 or predominantly TH2. However, it is often convenient to consider the families of 
cytokines in terms of that described in murine CD4 +ve T cell clones by Mosmann and 
Coffman {Mosmann, TR. and Cofftnan, R.L. (1989) TH1 and TH2 cells: different 
patterns oflymphokine secretion lead to different functional properties. Annual Review 
of Immunology, 7, pl45-173). Traditionally, THl-type responses are associated with 
the production of the INF-y and IL-2 cytokines by T-lymphocytes. Other cytokines 
often directly associated with the induction of THl-type immune responses are not 
produced by T-cells, such as IL-12. In contrast TH2- type responses are associated with 
the secretion of IL-4, IL-5, IL-6 and IL-13. 

It is known that certain vaccine adjuvants are particularly suited to the stimulation of 
either TH1 or TH2 - type cytokine responses. Traditionally the best indicators of the 
TH1 :TH2 balance of the immune response after a vaccination or infection includes 
direct measurement of the production of TH1 or TH2 cytokines by T lymphocytes in 
vitro after restimulation with antigen, and/or the measurement of the IgGl:IgG2a ratio 
of antigen specific antibody responses. 
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Thus, a THl-type adjuvant is one which preferentially stimulates isolated T-cell 
populations to produce high levels of THl-type cytokines when re-stimulated with 
antigen in vitro, and promotes development of both CD8^ cytotoxic T lymphocytes and 
antigen specific immunoglobulin responses associated with THl-type isotype. 

Adjuvants which are capable of preferential stimulation of the TH1 cell response are 
described in International Patent Application No. WO 94/00153 and WO 95/17209. 

3 De-O-acylated monophosphoryl lipid A (3D-MPL) is one such adjuvant. This is 
known from GB 222021 1 (Ribi). Chemically it is a mixture of 3 De-O-acylated 
monophosphoryl lipid A with 4 ? 5 or 6 acylated chains and is manufactured by Ribi 
Immunochem, Montana. A preferred form of 3 De-O-acylated monophosphoryl lipid 
A is disclosed in European Patent 0 689 454 Bl (SmithKline Beecham Biologicals SA). 

Preferably, the particles of 3D-MPL are small enough to be sterile filtered through a 
0.22micron membrane (European Patent number 0 689 454). 
3D-MPL will be present in the range of lOfig - lOOjixg preferably 25-50|ag per dose 
wherein the antigen will typically be present in a range 2-50p,g per dose. 

Another preferred adjuvant comprises QS21, an Hplc purified non-toxic fraction derived 
from the bark of Quillaja Saponaria Molina. Optionally this may be admixed with 3 
De-O-acylated monophosphoryl lipid A (3D-MPL), optionally together with an carrier. 

The method of production of QS21 is disclosed in US patent No. 5,057,540. 

Non-reactogenic adjuvant formulations containing QS21 have been described 
previously (WO 96/33739). Such formulations comprising QS21 and cholesterol have 
been shown to be successful TH1 stimulating adjuvants when formulated together with 
an antigen. 
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Further adjuvants which are preferential stimulators of TH1 cell response include 
immunomodulatory oligonucleotides, for example unmethylated CpG sequences as 
disclosed in WO 96/02555. 

Combinations of different TH1 stimulating adjuvants, such as those mentioned 
hereinabove, are also contemplated as providing an adjuvant which is a preferential 
stimulator of TH1 cell response. For example, QS21 can be formulated together with 
3D-MPL. The ratio of QS21 : 3D-MPL will typically be in the order of 1 : 10 to 10 : 1 ; 
preferably 1 :5 to 5 : 1 and often substantially 1:1. The preferred range for optimal 
synergy is 2.5 : 1 to 1 : 1 3D-MPL: QS21. 

Preferably a carrier is also present in the vaccine composition according to the 
invention. The carrier may be an oil in water emulsion, or an aluminium salt, such as 
aluminium phosphate or aluminium hydroxide. 

A preferred oil-in-water emulsion comprises a metabolisible oil, such as squalene, alpha 
tocopherol and Tween 80. In a particularly preferred aspect the antigens in the vaccine 
composition according to the invention are combined with QS21 and 3D-MPL in such 
an emulsion. Additionally the oil in water emulsion may contain span 85 and/or lecithin 
and/or tricaprylin. 

Typically for human administration QS21 and 3D-MPL will be present in a vaccine in 
the range of 1 jag - 200|ag, such as 10-100|ig, preferably 10|ag - 50fag per dose. 
Typically the oil in water will comprise from 2 to 10% squalene, from 2 to 10% alpha 
tocopherol and from 03 to 3% tween 80. Preferably the ratio of squalene: alpha 
tocopherol is equal to or less than 1 as this provides a more stable emulsion. Span 85 
may also be present at a level of 1%. In some cases it may be advantageous that the 
vaccines of the present invention will further contain a stabiliser. 
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Non-toxic oil in water emulsions preferably contain a non-toxic oil, e.g. squalane or 
squalene, an emulsifier, e.g. Tvveen 80, in an aqueous carrier. The aqueous carrier may 
be, for example, phosphate buffered saline. 

A particularly potent adjuvant formulation involving QS2L 3D-MPL and tocopherol 
in an oil in water emulsion is described in WO 95/17210. 

The present invention also provides a polyvalent vaccine composition comprising a 
vaccine formulation of the invention in combination with other antigens, in particular 
antigens useful for treating cancers, autoimmune diseases and related conditions. Such a 
polyvalent vaccine composition may include a TH-1 inducing adjuvant as hereinbefore 
described. 

While the invention has been described with reference to certain BASB027 polypeptides 
and polynucleotides, it is to be understood that this covers fragments of the naturally 
occurring polypeptides and polynucleotides, and similar polypeptides and polynucleotides 
with additions, deletions or substitutions which do not substantially affect the 
immunogenic properties of the recombinant polypeptides or polynucleotides. 

Compositions, kits and administration 

In a further aspect of the invention there are provided compositions comprising a BASB027 
polynucleotide and/or a BASB027 polypeptide for administration to a cell or to a 
multicellular organism. 

The invention also relates to compositions comprising a polynucleotide and/or a 
polypeptides discussed herein or their agonists or antagonists. The polypeptides and 
polynucleotides of the invention may be employed in combination with a non-sterile or 
sterile carrier or carriers for use with cells, tissues or organisms, such as a pharmaceutical 
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carrier suitable for administration to an individual. Such compositions comprise, for 
instance, a media additive or a therapeutically effective amount of a polypeptide and/or 
polynucleotide of the invention and a pharmaceutically acceptable carrier or excipient. Such 
carriers may include, but are not limited to. saline, buffered saline, dextrose, water, glycerol, 
ethanol and combinations thereof. The formulation should suit the mode of administration. 
The invention further relates to diagnostic and pharmaceutical packs and kits comprising 
one or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. 

Polypeptides, polynucleotides and other compounds of the invention may be employed 
alone or in conjunction with other compounds, such as therapeutic compounds. 

The pharmaceutical compositions may be administered in any effective, convenient manner 
including, for instance, administration by topical, oral, anal, vaginal, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes among others. 

In therapy or as a prophylactic, the active agent may be administered to an individual as 
an injectable composition, for example as a sterile aqueous dispersion, preferably 
isotonic. 

In a further aspect, the present invention provides for pharmaceutical compositions 
comprising a therapeutically effective amount of a polypeptide and/or polynucleotide, such 
as the soluble form of a polypeptide and/or polynucleotide of the present invention, agonist 
or antagonist peptide or small molecule compound, in combination with a pharmaceutically 
acceptable carrier or excipient. Such carriers include, but are not limited to, saline, buffered 
saline, dextrose, water, glycerol, ethanol, and combinations thereof. The invention further 
relates to pharmaceutical packs and kits comprising one or more containers filled with one 
or more of the ingredients of the aforementioned compositions of the invention. 
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Polypeptides, polynucleotides and other compounds of the present invention may be 
employed alone or in conjunction with other compounds, such as therapeutic compounds. 

The composition will be adapted to the route of administration, for instance by a systemic or 
an oral route. Preferred forms of systemic administration include injection, typically by 
intravenous injection. Other injection routes, such as subcutaneous, intramuscular, or 
intraperitoneal, can be used. Alternative means for systemic administration include 
transmucosal and transdermal administration using penetrants such as bile salts or fiisidic 
acids or other detergents. In addition, if a polypeptide or other compounds of the present 
invention can be formulated in an enteric or an encapsulated formulation, oral 
administration may also be possible. Administration of these compounds may also be 
topical and/or localized, in the form of salves, pastes, gels, solutions, powders and the like. 

For administration to mammals, and particularly humans, it is expected that the daily 
dosage level of the active agent will be from 0.01 mg/kg to 10 mg/kg, typically around 1 
mg/kg. The physician in any event will determine the actual dosage which will be most 
suitable for an individual and will vary with the age, weight and response of the particular 
individual The above dosages are exemplary of the average case. There can, of course, 
be individual instances where higher or lower dosage ranges are merited, and such are 
within the scope of this invention. 

The dosage range required depends on the choice of peptide, the route of administration, the 
nature of the formulation, the nature of the subject's condition, and the judgment of the 
attending practitioner. Suitable dosages, however, are in the range of 0.1-100 ng/kg of 
subject. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants may be 
employed to enhance the immune response. A suitable unit dose for vaccination is 0.5-5 
microgram/kg of antigen, and such dose is preferably administered 1-3 times and with an 
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interval of 1-3 weeks. With the indicated dose range, no adverse toxicological effects will 
be observed with the compounds of the invention which would preclude their 
administration to suitable individuals. 

Wide variations in the needed dosage, however, are to be expected in view of the variety of 
compounds available and the differing efficiencies of various routes of administration. For 
example, oral administration would be expected to require higher dosages than 
administration by intravenous injection. Variations in these dosage levels can be adjusted 
using standard empirical routines for optimization, as is well understood in the art. 

Sequence Databases, Sequences in a Tangible Medium, and Algorithms 

Polynucleotide and polypeptide sequences form a valuable information resource with which 
to determine their 2- and 3-dimensional structures as well as to identify further sequences of 
similar homology. These approaches are most easily facilitated by storing the sequence in a 
computer readable medium and then using the stored data in a known macromolecular 
structure program or to search a sequence database using well known searching tools, such 
as the GCG program package. 

Also provided by the invention are methods for the analysis of character sequences or 
strings, particularly genetic sequences or encoded protein sequences. Preferred methods 
of sequence analysis include, for example, methods of sequence homology analysis, such 
as identity and similarity analysis, DNA, RNA and protein structure analysis, sequence 
assembly, cladistic analysis, sequence motif analysis, open reading frame determination, 
nucleic acid base calling, codon usage analysis, nucleic acid base trimming, and 
sequencing chromatogram peak analysis. 

A computer based method is provided for performing homology identification. This 
method comprises the steps of: providing a first polynucleotide sequence comprising the 
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sequence of a polynucleotide of the invention in a computer readable medium; and 
comparing said first polynucleotide sequence to at least one second polynucleotide or 
polypeptide sequence to identify homology. 

A computer based method is also provided for performing homology identification, said 
method comprising the steps of: providing a first polypeptide sequence comprising the 
sequence of a polypeptide of the invention in a computer readable medium; and 
comparing said first polypeptide sequence to at least one second polynucleotide or 
polypeptide sequence to identify homology. 

All publications and references, including but not limited to patents and patent 
applications, cited in this specification are herein incorporated by reference in their 
entirety as if each individual publication or reference were specifically and individually 
indicated to be incorporated by reference herein as being fully set forth. Any patent 
application to which this application claims priority is also incorporated by reference 
herein in its entirety in the manner described above for publications and references. 

DEFINITIONS 

"Identity," as known in the art, is a relationship between two or more polypeptide sequences 
or two or more polynucleotide sequences, as the case may be, as determined by comparing 
the sequences. In the art, "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be, as determined by the match 
between strings of such sequences. "Identity" can be readily calculated by known 
methods, including but not limited to those described in (Computational Molecular 
Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., 
Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heine, 
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G., Academic Press. 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., 
eds., M Stockton Press. New York. 1991; and Carillo. H., and Lipman, D., SIAM J. 
Applied Math., 48: 1073 (1988). Methods to determine identity are designed to give the 
largest match between the sequences tested. Moreover, methods to determine identity are 
codified in publicly available computer programs. Computer program methods to 
determine identity between two sequences include, but are not limited to, the GAP 
program in the GCG program package (Devereux. J., et al.. Nucleic Acids Research 12(1): 
387 (1984)). BLASTP, BLASTN (Altschul, S.F. et al.. J. Molec. Biol. 215: 403-410 
(1990), and FASTA( Pearson and Lipman Proc. Natl. Acad. Sci. USA 85; 2444-2448 
(1988). The BLAST family of programs is publicly available from NCBI and other 
sources {BLAST Manual, Altschul, S., et al, NCBI NLM NIH Bethesda, MD 20894; 
Altschul. S., et al.,J. Mol. Biol. 215: 403-410 (1990). The well known Smith Waterman 
algorithm may also be used to determine identity. 

Parameters for polypeptide sequence comparison include the following: 

Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 

Comparison matrix: BLOSSUM62 from Henikoff and Henikoff, 

Proc. Natl. Acad. Sci. USA. 89:10915-10919 (1992) 

Gap Penalty: 8 

Gap Length Penalty: 2 

A program useful with these parameters is publicly available as the "gap" program from 
Genetics Computer Group, Madison WI. The aforementioned parameters are the default 
parameters for peptide comparisons (along with no penalty for end gaps). 

Parameters for polynucleotide comparison include the following: 

Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 

Comparison matrix: matches = +10, mismatch = 0 

Gap Penalty: 50 

Gap Length Penalty: 3 



43 



WO 99/63093 



PCT/EP99/03822 



Available as: The "gap" program from Genetics Computer Group, Madison WL These 
are the default parameters for nucleic acid comparisons. 

A preferred meaning for "identity" for polynucleotides and polypeptides, as the case may 
be, are provided in (1) and (2) below. 

(1 ) Polynucleotide embodiments further include an isolated polynucleotide 
comprising a polynucleotide sequence having at least a 50, 60, 70, 80, 85, 90, 95, 97 or 
100% identity to the reference sequence of SEQ ID NO: 1, wherein said polynucleotide 
sequence may be identical to the reference sequence of SEQ ID NO:l or may include up 
to a certain integer number of nucleotide alterations as compared to the reference 
sequence, wherein said alterations are selected from the group consisting of at least one 
nucleotide deletion, substitution, including transition and transversion, or insertion, and 
wherein said alterations may occur at the 5' or 3' terminal positions of the reference 
nucleotide sequence or anywhere between those terminal positions, interspersed either 
individually among the nucleotides in the reference sequence or in one or more 
contiguous groups within the reference sequence, and wherein said number of nucleotide 
alterations is determined by multiplying the total number of nucleotides in SEQ ID NO:l 
by the integer defining the percent identity divided by 100 and then subtracting that 
product from said total number of nucleotides in SEQ ID NO: 1, or: 

n n < x n - (x n • y), 

wherein n n is the number of nucleotide alterations, x n is the total number of nucleotides 
in SEQ ID NO: 1, y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 
85%>, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1 .00 for 100%, and • is the symbol for 
the multiplication operator, and wherein any non-integer product of x n and y is rounded 
down to the nearest integer prior to subtracting it from x n . Alterations of a polynucleotide 
sequence encoding the polypeptide of SEQ ID NO: 2 may create nonsense, missense or 
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frameshift mutations in this coding sequence and thereby alter the polypeptide encoded by 
the polynucleotide following such alterations. 

By way of example, a polynucleotide sequence of the present invention may be identical 
to the reference sequence of SEQ ID NO: 1 , that is it may be 1 00% identical, or it may 
include up to a certain integer number of nucleic acid alterations as compared to the 
reference sequence such that the percent identity is less than 100% identity. Such 
alterations are selected from the group consisting of at least one nucleic acid deletion, 
substitution, including transition anct transversion, or insertion, and wherein said 
alterations may occur at the 5' or 3' terminal positions of the reference polynucleotide 
sequence or anywhere between those terminal positions, interspersed either individually 
among the nucleic acids in the reference sequence or in one or more contiguous groups 
within the reference sequence. The number of nucleic acid alterations for a given percent 
identity is determined by multiplying the total number of nucleic acids in SEQ ID NO:l 
by the integer defining the percent identity divided by 100 and then subtracting that 
product from said total number of nucleic acids in SEQ ID NO: 1 , or: 

n n < x n - (x n • y), 

wherein n n is the number of nucleic acid alterations, x n is the total number of nucleic 
acids in SEQ ID NO:l, y is, for instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% etc., • 
is the symbol for the multiplication operator, and wherein any non-integer product of x n 
and y is rounded down to the nearest integer prior to subtracting it from x n . 

(2) Polypeptide embodiments further include an isolated polypeptide comprising a 
polypeptide having at least a 50,60, 70, 80, 85, 90, 95, 97 or 100% identity to a 
polypeptide reference sequence of SEQ ID NO:2, wherein said polypeptide sequence may 
be identical to the reference sequence of SEQ ID NO:2 or may include up to a certain 
integer number of amino acid alterations as compared to the reference sequence, wherein 
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said alterations are selected from the group consisting of at least one amino acid deletion, 
substitution, including conservative and non-conservative substitution, or insertion, and 
wherein said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, 
interspersed either individually among the amino acids in the reference sequence or in one 
or more contiguous groups within the reference sequence, and wherein said number of 
amino acid alterations is determined by multiplying the total number of amino acids in 
SEQ ID NO:2 by the integer defining the percent identity divided by 100 and then 
subtracting that product from said total number of amino acids in SEQ ID NO:2, or: 

n a < x a - (x a • y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino acids 
in SEQ ID NO:2, y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 
85%, 0.90 for 90%, 0.95 for 95%, 0,97 for 97% or 1.00 for 100%, and • is the symbol for 
the multiplication operator, and wherein any non-integer product of x a and y is rounded 
down to the nearest integer prior to subtracting it from x a . 

By way of example, a polypeptide sequence of the present invention may be identical to 
the reference sequence of SEQ ID NO:2, that is it may be 100% identical, or it may- 
include up to a certain integer number of amino acid alterations as compared to the 
reference sequence such that the percent identity is less than 100% identity. Such 
alterations are selected from the group consisting of at least one amino acid deletion, 
substitution, including conservative and non-conservative substitution, or insertion, and 
wherein said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, 
interspersed either individually among the amino acids in the reference sequence or in one 
or more contiguous groups within the reference sequence. The number of amino acid 
alterations for a given % identity is determined by multiplying the total number of amino 
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acids in SEQ ID NO:2 by the integer defining the percent identity divided by 100 and 
then subtracting that product from said total number of amino acids in SEQ ID NO:2, or: 

n a < x a - (x a • y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino acids 
in SEQ ID NO:2, y is, for instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% etc., and • is 
the symbol for the multiplication operator, and wherein any non-integer product of x a and 
y is rounded down to the nearest integer prior to subtracting it from x a . 

"Individual(s), n when used herein with reference to an organism, means a multicellular 
eukaryote, including, but not limited to a metazoan, a mammal an ovid, a bovid. a simian, 
a primate, and a human. 

"Isolated" means altered f, by the hand of man" from its natural state, i.e. 9 if it occurs in 
nature, it has been changed or removed from its original environment, or both. For example, 
a polynucleotide or a polypeptide naturally present in a living organism is not "isolated," but 
the same polynucleotide or polypeptide separated from the coexisting materials of its natural 
state is "isolated", as the term is employed herein. Moreover, a polynucleotide or 
polypeptide that is introduced into an organism by transformation, genetic manipulation or 
by any other recombinant method is "isolated" even if it is still present in said organism, 
which organism may be living or non-living. 

"Polynucleotide^)" generally refers to any polyribonucleotide or polydeoxyribonucleotide, 
which may be unmodified RNA or DNA or modified RNA or DNA including single and 
double-stranded regions. 

"Variant" refers to a polynucleotide or polypeptide that differs from a reference 
polynucleotide or polypeptide, but retains essential properties. A typical variant of a 
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polynucleotide differs in nucleotide sequence from another, reference polynucleotide. 
Changes in the nucleotide sequence of the variant may or may not alter the amino acid 
sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide changes 
may result in amino acid substitutions, additions, deletions, fusions and truncations in 
the polypeptide encoded by the reference sequence, as discussed below. A typical 
variant of a polypeptide differs in amino acid sequence from another, reference 
polypeptide. Generally, differences are limited so that the sequences of the reference 
polypeptide and the variant are closely similar overall and, in many regions, identical. 
A variant and reference polypeptide may differ in amino acid sequence by one or more 
substitutions, additions, deletions in any combination. A substituted or inserted amino 
acid residue may or may not be one encoded by the genetic code. A variant of a 
polynucleotide or polypeptide may be a naturally occurring such as an allelic variant, or 
it may be a variant that is not known to occur naturally. Non-naturally occurring 
variants of polynucleotides and polypeptides may be made by mutagenesis techniques 
or by direct synthesis. 

n Disease(s) n means any disease caused by or related to infection by a bacteria, including, 
for example, otitis media in infants and children, pneumonia in elderlies, sinusitis, 
nosocomial infections and invasive diseases, chronic otitis media with hearing loss, fluid 
accumulation in the middle ear, auditive nerve damage, delayed speech learning, infection 
of the upper respiratory tract and inflammation of the middle ear. 
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EXAMPLES: 

The examples below are carried out using standard techniques, which are well known and 
routine to those of skill in the art, except where otherwise described in detail. The examples 
are illustrative, but do not limit the invention. 

Example 1: 

Discovery and confirmatory DNA sequencing of the BASB027 gene from 
Moraxella catarrhalis strain ATCC 43617. 

The BASB027 gene of SEQ ID NO; 1 was first discovered in the Incyte PathoSeq data 
base containing unfinished genomic DNA sequences of the Moraxella catarrhalis strain 
ATCC 4361 7 (also referred to as strain Mc293 1). The translation of the BASB027 
polynucleotide sequence, shown in SEQ ID NO:2, showed significant similarity (32 % 
identity in a 817 amino acids overlap) to the OMP85 outer membrane protein of 
Neisseria meningitidis. 

The sequence of the BASB027 gene was further confirmed experimentally. For this 
purpose, genomic DNA was extracted from 10 10 cells of the M catarrhalis cells (strain 
ATCC 43617) using the QIAGEN genomic DNA extraction kit (Qiagen Gmbh), and 
1 jig of this material was submitted to Polymerase Chain Reaction DNA amplification 
using primers E515515 (5'- ACT-ATA-GGG-CAC-GCG-TG -3') [SEQ ID NO:5] and 
E515528 : (5*- CCT-GCG-TTT-GTT-TGA-TTG-AG-3 ') [SEQ ID NO:6]. This PCR 
product was purified on a Biorobot 9600 (Qiagen Gmbh) apparatus and subjected to 
DNA sequencing using the Big Dye Cycle Sequencing kit (Perkin-Elmer) and an ABI 
377/PRISM DNA sequencer* DNA sequencing was performed on both strands with a 
redundancy of 2 and the full length sequence was assembled using the SeqMan program 
from the DNASTAR Lasergene software package. The resulting DNA sequence and 
deduced polypeptide sequence are shown as SEQ ID NO:3 and SEQ ID NO:4 
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respectively. Four nucleotide differences distinguish SEQ ID NO:3 from SEQ ID NO:l. 
Using the MEGALIGN program from the DNASTAR Lasergene software package, an 
alignment of the polynucleotide sequences of SEQ ID NO:l and 3 was performed, and 
is displayed in Figure 2; their level of identity was calculated to be 99.8 %. 
Using the same program, an alignment of the polypeptide sequences of SEQ ID NO:2 
and 4 was performed, and is displayed in Figure 3; their level of identity was calculated 
to be 99.8 %. 

Example 2: 

Variability analysis of the BASB027 gene among several Moraxella catarrhal* 
strains. 

2A: Restriction Fragment Length Analysis (RFLP). 

Genomic DNA was extracted from 16 M. catarrhalis strains (presented in Table 1) as 
described below. M. catarrhalis was streaked for single colonies on BHI agar plates 
and grown overnight at 37 °C. Three or four single colonies were picked and used to 
inoculate a -1.5 ml BHI (Brain-heart infusion) broth seed culture which was grown 
overnight in a shaking incubator, -300 rpm, at 37 °C. A 500ml erlenmeyer flask 
containing -150 ml of BHI broth was inoculated with the seed culture and grown for 
-12-16 hours at 37 °C in a shaking incubator, -175 rpm, to generate cell mass for DNA 
isolation. Cells were collected by centrifugation in a Sorvall GSA rotor at -2000 X g 
for 15 minutes at room temperature. The supernatant was removed and the cell pellet 
suspended in -5.0 ml of sterile water. An equal volume of lysis buffer (200 mM NaCl, 
20 mM EDTA, 40 mM Tris-Hcl, pH 8.0, 0.5% (w/v) SDS, 0.5% (v/v) 2- 
mercaptoethanol, and 250 ug/ml of proteinase K) was added and the cells suspended by 
gentle agitation and trituration. The cell suspension was then incubated -12 hours at 
50°C to lyse the bacteria and liberate chromosomal DNA. Proteinaceous material was 
precipitated by the addition of 5.0 ml of saturated NaCl (-6.0 M, in sterile water) and 
centrifugation at ~5,500xg in a Sorvall SS34 rotor at room temperature. Chromosomal 
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DNA was precipitated from the cleared supernatant by the addition of two volumes of 
100 % ethanol. Aggregated DNA was collected and washed using gentle agitation in a 
small volume of a 70 % ethanol solution. Purified chromosomal DNA was suspended 
in sterile water and allowed to dissolve/ disburse overnight at 4 °C by gentle rocking. 
The concentration of dissolved DNA was determined spectrophotometrically at 260 nm 
using an extinction coefficient of 1.0 O.D. unit -50 jug/ml. 

This material was next submitted to PCR amplification using the MC-D15-BamF (5"- 
AAG GGC CCA ATT ACG CAG AGG GGA TCC ACA GGA CTA CAG CGA GTG 
ACC ATT GAA AGC TTA C -3') [SEQ ID NO:7] and MC-D15-SalRC (AAG GGC 
CCA ATT ACG CAG AGG GTC GAC TTA TTA AAA GAC ACT ACC AAT CTG 
GAA CTG TAC CGT ATC G -3') [SEQ ID NO:8] oligonucleotides. The corresponding 
BASB027 gene amplicons were then subjected independantly to hydrolysis using 
restriction enzymes (Acil, Hindlll, Madll, MallL Rsal, SauSAI) and restriction 
products were separated by agarose or poiyacrylamide gel electrophoresis using 
standard molecular biology procedures as described in 44 Molecular Cloning, a 
Laboratory Manual, Second Edition, Eds: Sambrook, Fritsch & Maniatis, Cold Spring 
Harbor press 1989", The photographs of the resulting electrophoresis gels are displayed 
in Figure 1 , For each strain, RFLP patterns corresponding to the 6 restriction enzymes 
were scored and combined. Groups of strains sharing identical combination of RFLP 
patterns were then defined. Using this methodology, the strains tested in this study fell 
into 4 genomic groups ( Group 1 : Mc2906, Mc 2908, Mc2912 ? Mc2926; Group 2 : 
Mc2905, Mc2907, Mc2909, Mc2911, Mc2913, Mc2960, Mc2975 ; Group 3: Mc2910, 
Mc2912, Mc2956, Mc2969; Group 4 : Mc293 1). These data support that the Moraxella 
catarrhalis population used in this study displays limited nucleotide sequence diversity 
for the BASB027 gene. 
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Table 1: Features of the Moraxella catarrhalis strains used in this study 



Strain 


Isolated in: 


from: 


Mc2904 


USA 


Tympanocentesis 


Mc2905 


USA 


Tympanocentesis 


Mc2906 


USA 


Tympanocentesis 


Mc2907 


USA 


Tympanocentesis 


Mc2908 


USA 


Acute otitis Tympanocentesis 


Mc2909 


USA 


Tympanocentesi s 


Mc2910 


USA 


Tympanocentesis 


Mc2911 


USA 


Acute otitis Tympanocentesis 


Mc2912 


USA 


Acute otitis Tympanocentesis 


Mc2913 


USA 


Acute otitis Tympanocentesis 


Mc2926 


USA 


Tympanocentesis 


Mc2931 


USA 


Transtracheal aspirate 


/ATCC 






43617 






Mc2956 


Finland 


Middle ear fluid 


Mc2960 


Finland 


Middle ear fluid 


Mc2969 


Norway 


Nasopharynx (Pharyngitis- 
Rhinitis) 


Mc2975 


Norway 


Nasopharynx (Rhinitis) 
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Example 3: Construction of Plasmid to Express Recombinant BASB027 

A: Cloning of BASB027 . 

The BamHl and Sail restriction sites engineered into the forward ([SEQ ID NO:7J) and 
reverse complementary ([SEQ ID NO:8]) amplification primers, respectively, permitted 
directional cloning of an -2500 bp PCR product into the commercially available E.coli 
expression plasmid pQE30 (QiaGen, ampicillin resistant) such that a mature BASB027 
protein could be expressed as a fusion protein containing a (His)6 affinity 
chromatography tag at the N-terminus. The BASB027 PCR product was purified from 
the amplification reaction using silica gel-based spin columns (QiaGen) according to the 
manufacturers instructions. To produce the required BamHl and Sail termini necessary 
for cloning, purified PCR product was sequentially digested to completion with BamHl 
and Sail restriction enzymes as recommended by the manufacturer (Life Technologies). 
Following the first restriction digestion, the PCR product was purified via spin column 
as above to remove salts and eluted in sterile water prior to the second enzyme 
digestion. The digested DNA fragment was again purified using silica gel-based spin 
columns prior to ligation with the pQE30 plasmid. 

B: Production of Expression Vector, 

To prepare the expression plasmid pQE30 for ligation, it was similarly digested to 
completion with both BamHl and Sail and then treated with calf intestinal phosphatase 
(CIP, -0.02 units / pmole of 5 , end, Life Technologies) as directed by the manufacturer 
to prevent self ligation. An approximately 5-fold molar excess of the digested fragment 
to the prepared vector was used to program the ligation reaction. A standard -20 ^il 
ligation reaction (-1 6°C, -1 6 hours), using methods well known in the art, was 
performed using T4 DNA ligase (-2.0 units / reaction, Life Technologies). An aliquot 
of the ligation (-5 ]x\) was used to transform electro-competent M15(pREP4) cells 
according to methods well known in the art. Following a -2-3 hour outgrowth period at 
37°C in -1.0 ml of LB broth, transformed cells were plated on LB agar plates 
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containing kanamycin (50 fig/ml) and ampiciliin (100 jig/ml). Both antibiotics were 
included in the selection media to ensure that all transformed cells carried both the 
pREP4 plasmid (KnR), which carries the laclq gene necessary for the repression of 
expression for IPTG-inducible expression of proteins on pQE30, and the pQE30- 
BASB027 plasmid (ApR). Plates were incubated overnight at 37°C for -1 6 hours. 
Individual KnR / ApR colonies were picked with sterile toothpicks and used to "patch" 
inoculate fresh LB KnR / ApR plates as well as a ~1 .0 ml LB KnR / ApR broth culture. 
Both the patch plates and the broth culture were incubated overnight at 37°C in either a 
standard incubator (plates) or a shaking water bath. 

A whole cell-based PCR analysis was employed to verify that transformants contained 
the BASB027 DNA insert. Here, the -1 .0 ml overnight LB Kn / Ap broth culture was 
transferred to a 1 .5 ml polypropylene tube and the cells collected by centrifugation in a 
Beckmann microcentrifuge (-3 min., room temperature, -12,000 X g). The cell pellet 
was suspended in ~200]-il of sterile water and a ~10jal aliquot used to program a ~50jLil 
final volume PCR reaction containing both BASB027 forward and reverse amplification 
primers. Final concentrations of the PCR reaction components were essentially the 
same as those specified in example 2 except -5.0 units of Tag polymerase was used. 
The initial 95 °C denaturation step was increased to 3 minutes to ensure thermal 
disruption of the bacterial cells and liberation of plasmid DNA. An ABI Model 9700 
thermal cycler and a 32 cycle, three-step thermal amplification profile, i.e. 95°C, 45sec; 
55-58°C, 45sec, 72°C, Imin., were used to amplify the BASB027 PCR fragment from 
the lysed transformant samples. Following thermal amplification, a ~20jj.l aliquot of the 
reaction was analyzed by agarose gel electrophoresis (0.8 % agarose in a Tris-acetate- 
EDTA (TAE) buffer). DNA fragments were visualized by UV illumination after gel 
electrophoresis and ethidium bromide staining. A DNA molecular size standard (1 Kb 
ladder. Life Technologies) was electrophoresed in parallel with the test samples and was 
used to estimate the size of the PCR products. Transformants that produced the 
expected ~ 2500 bp PCR product were identified as strains containing a B ASB027 
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expression construct. Expression plasmid containing strains were then analyzed for the 
inducible expression of recombinant BASB027. 

C; Expression Analysts of PCR-Positive Transformants. 

For each PCR-positive transformant identified above, -5.0 ml of LB broth containing 
kanamycin (50 ug/ml) and ampicillin (100 ug/ml) was inoculated with cells from the 
patch plate and grown overnight at 37 °C with shaking (-250 rpm). An aliquot of the 
overnight seed culture (-1.0 ml) was inoculated into a 125 ml erlenmeyer flask 
containing -25 ml of LB Kn / Ap broth and grown at 37 °C with shaking (-250 rpm) 
until the culture turbidity reached O.D.600 of -0.5, i.e. mid-log phase (usually about 1.5 
- 2.0 hours). At this time approximately half of the culture (-12.5 mi) was transferred to 
a second 125 ml flask and expression of recombinant BASB027 protein induced by the 
addition of IPTG (1.0 M stock prepared in sterile water, Sigma) to a final concentration 
of 1 .0 mM. Incubation of both the IPTG-induced and non-induced cultures continued 
for an additional -4 hours at 37 °C with shaking. Samples (-1 .0 ml) of both induced 
and non-induced cultures were removed after the induction period and the cells 
collected by centrifugation in a microcentrifuge at room temperature for -3 minutes. 
Individual cell pellets were suspended in ~50ul of sterile water, then mixed with an 
equal volume of 2X Laemelli SDS-PAGE sample buffer containing 2-mercaptoethanol, 
and placed in boiling water bath for -3 min to denature protein. Equal volumes (~15ul) 
of both the crude IPTG-induced and the non-induced cell lysates were loaded onto 
duplicate 12% Tris/glycine polyacrylamide gel (1 mm thick Mini-gels, Novex). The 
induced and non-induced lysate samples were electrophoresed together with prestained 
molecular weight markers (SeeBlue, Novex) under conventional conditions using a 
standard SDS/Tris/glycine running buffer (BioRad). Following electrophoresis, one gel 
was stained with commassie brilliant blue R250 (BioRad) and then destained to 
visualize novel BASB027 IPTG-inducible protein(s). The second gel was electroblotted 
onto a PVDF membrane (0.45 micron pore size, Novex) for -2 hrs at 4 °C using a 
BioRad Mini-Protean II blotting apparatus and Towbin s methanol (20 %) transfer 
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buffer. Blocking of the membrane and antibody incubations were performed according 
to methods well known in the art. A monoclonal anti-RGS (His)3 antibody, followed 
by a second rabbit anti-mouse antibody conjugated to HRP (QiaGen), was used to 
confirm the expression and identity of the BASB027 recombinant protein. 
Visualization of the anti-His antibody reactive pattern was achieved using either an 
ABT insoluble substrate or using Hyperfilm with the Amersham ECL 
chemiluminescence system. 

D: Sequence Confirmation . 

To further verify that the IPTG-inducible recombinant BASB027 protein being 
expressed is in the correct open reading frame and not a spurious molecule arising from 
a cloning artifact (i.e. a frame-shift), the DNA sequence of the cloned insert was 
determined. The DNA sequence for the M.catarrhalis BASB027 gene was obtained 
from one strand using conventional asymmetric PCR cycle sequencing methodologies 
(ABI Prism Dye-Terminator Cycle Sequencing, Perkin-Elmer). Sequencing reactions 
were programmed with undigested expression plasmid DNA (~0.5jig/rxn) as a template 
and appropriate pQE30 vector-specific and ORF-specific sequencing primers (-3.5 
pmol/rxn). In addition to the template and sequencing primer, each sequencing reaction 
(~20|al) contained the four different dNTPs (i.e. A,G,C, and T) and the four 
corresponding ddNTPs (i.e. ddA, ddG, ddC, and ddT) terminator nucleotides; with each 
terminator being conjugated to one of the four fluorescent dyes, Joe, Tarn, Rox, or Fam. 
Single strand sequencing elongation products were terminated at random positions 
along the template by the incorporation of the dye-labelled ddNTP terminators. 
Fluorescent dye-labelled termination products were purified using microcentrifuge size- 
exclusion chromatography columns (Princeton Genetics), dried under vacuum, 
suspended in a Template Resuspension Buffer (Perkin-Elmer) for capillary 
electrophoresis or deionized formamide for PAGE, denatured at 95°C for -5 min, and 
analyzed by high resolution capillary electrophoresis (ABI 310 Automated DNA 
Sequenator, Perkin-Elmer) or high resolution PAGE (ABI 377 Automated DNA 
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Sequenator) as recommended by the manufacturer. DNA sequence data produced from 
individual reactions were collected and the relative fluorescent peak intensities analyzed 
automatically on a PowerMAC computer using ABI Sequence Analysis Software 
(Perkin-Elmer). Individually autoanalyzed DNA sequences were edited manually for 
accuracy before being merged into a consensus single strand sequence string" using 
AutoAssembler software (Perkin-Elmer). Sequencing determined that the expression 
plasmid contained the correct sequence in the correct open reading frame. 

Example 4 : Production of Recombinant BASBQ27 

Bacterial strain 

A recombinant expression strain of £ coli Ml 5 (pREP4) containing a plasmid (pQE30) 
encoding BASB027 from M. catarrhalis. was used to produce cell mass for purification 
of recombinant protein. The expression strain was cultivated on LB agar plates 
containing 50jig/ml kanamycin ("Kn") and 100ng/ml ampicillin ("Ap") to ensure both 
the pREP4 laclq control plasmid and the pQE30-BASB027 expression construct were 
both maintained. For cryopreservation at -80 °C, the strain was propagated in LB broth 
containing the same concentration of antibiotics then mixed with an equal volume of LB 
broth containing 30% (w/v) glycerol. 

Media 

The fermentation medium used for the production of recombinant protein consisted of 
2X YT broth (Difco) containing 50|ig/mi Kn and 100|ng/ml Ap. Antifoam was added to 
medium for the fermentor at 0.25 ml/L (Antifoam 204, Sigma). To induce expression of 
the BASB027 recombinant protein, IPTG (Isopropyl 6-D-Thiogalactopyranoside) was 
added to the fermentor (1 mM, final). 

Fermentation 
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A 500-ml erienmeyer seed flask, containing 50ml working volume, was inoculated with 
0.3 ml of rapidly thawed frozen culture, or several colonies from a selective agar piate 
culture, and incubated for approximately 12 hours at 37 ± 1 °C on a shaking platform at 
150rpm (Innova 2100, New Brunswick Scientific). This seed culture was then used to 
inoculate a 5-L working volume fermentor containing 2X YT broth and both Kn and Ap 
antibiotics. The fermentor (Bioflo 3000, New Brunswick Scientific) was operated at 37 
± 1°C 0.2 - 0.4 VVM air sparge, 250 rpm in Rushton impellers. The pH was not 
controlled in either the flask seed culture or the fermentor. During fermentation, the pH 
ranged 6.5 to 7.3 in the fermentor, IPTG (1.0 M stock, prepared in sterile water) was 
added to the fermentor when the culture reached mid-log of growth (-0.7 O.D.600 
units). Cells were induced for 2 - 4 hours then harvested by centrifugation using either a 
28RS Heraeus (Sepatech) or RC5C superspeed centrifuge (Sorvall Instruments). Cell 
paste was stored at -20 °C until processed. 

Purification 

Chemicals and Materials 

Imidazole, guanidine hydrochloride, Tris (hydroxymethyl), and EDTA (ethylene- 
diamine tetraacetic acid) biotechnology grade or better were all obtained from 
Ameresco Chemical, Solon, Ohio. Triton X-100 (t-Octylphenoxypolyethoxy-ethanol), 
sodium phosphate, monobasic, and Urea were reagent grade or better and obtained from 
Sigma Chemical Company, St. Louis, Missouri. Glacial acetic acid and hydrochloric 
acid were obtained from Mallincrodt Baker Inc., Phillipsburg, New Jersey. Methanol 
was obtained from Fisher Scientific, Fairlawn, New Jersey. Pefabloc®SC (4-(2- 
Aminoethyl)-benzenesulfonylfuoride), Complete protease inhibitor cocktail tablets, and 
PMSF (phenyimethyl-sulfonylfluoride) were obtained from Roche Diagnostics 
Corporation, Indianapolis, Indiana. Bestatin, Pepstatin A, and E-64 protease inhibitor 
were obtained from Calbiochem, LaJolla, California. Dulbecco's Phosphate Buffered 
Saline(lx PBS) was obtained from Quality Biological Inc., Gaithersburg, Maryland. 
Dulbecco's Phosphate Buffered Saline (lOx PBS) was obtained from BioWhittaker, 
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Walkersville. Maryland. Penta-His Antibody. BSA free was obtained from QiaGen, 
Valencia. California. Peroxidase-conjugated AffmiPure Goat Anti-mouse IgG was 
obtained from Jackson Immuno Research. West Grove, Penm AEC single solution was 
obtained from Zymed, South San Francisco. California. All other chemicals were 
reagent grade or better. 

Ni-NTA Superflow resin was obtained from QiaGen Inc., Valencia, California. Precast 
Tris-Glycine 4-20% and 10-20% polyacrylamide gels, all running buffers and solutions, 
SeeBlue Pre-Stained Standards, MultiMark Multi-Colored Standards and PVDF transfer 
membranes were obtained from Novex, San Diego, California. SDS-PAGE Silver Stain 
kits were obtained from Daiichi Pure Chemicals Company Limited, Tokyo, Japan. 
Coomassie Stain Solution was obtained from Bio-Rad Laboratories, Hercules, 
California. Acrodisc® PF 0.2 m syringe filters were obtained from Pall Gelman 
Sciences, Ann Arbor, Michigan. GD/X 25mm disposable syringe filters were obtained 
from Whatman Inc., Clifton, New Jersey. Dialysis tubing 8,000 MWCO was obtained 
from BioDesign Inc. Od New York, Carmal New York. BCA Protein Assay Reagents 
and Snake Skin dialysis tubing 3,500 MWCO were obtained from Pierce Chemical Co., 
Rockford, Illinois. 

Extraction Protocol 

Cell paste was thawed at room temperature for 30 to 60 minutes. Five to six grams of 
material was weighed out into a 50 ml disposable centrifuge tube. To this five 
mis/gram of Guanidine hydrochloride (Gu-HCl) buffer was added (6 M Guanidine 
hydrochloride, 100 mM Sodium phosphate, monobasic, 10 mM Tris and 0.05 % Triton 
X-100, pH 8.0). Cell paste was resuspended using a PRO300D proscientific 
homogenizes at 3/4 power for one minute. The extraction mixture was then placed at 
room temperature with gentle agitation for 60 to 90 minutes. After 60 to 90 minutes the 
extraction mixture was centrifuged at 15,800 x g for 15 minutes (Sorvall RC5C 
centrifuge, 1 1,500 rpm). The supernatant (SI) was decanted and saved for additional 
purification. The pellet (PI) was saved for analysis. 
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Binding of BASB027 to Nickel-NTA Resin 

To the SI three to four mis of Ni-NTA resin is added. This is then placed at room 
temperature with gentle agitation for one hour. After one hour the S 1 /Ni-NTA is 
packed into an XK16 Pharmacia column. The column is then washed with 1 M Gu-HCl 
buffer (1 M Guanidine hydrochloride, lOOmM Sodium phosphate, monobasic, 10 mM 
Tris and 0.05% Triton X-100, pH 8.0). This is then followed by a wash with phosphate 
buffer (lOOmM Sodium phosphate, monobasic, 10 mM Tris and 0.05% Triton X-100, 
pH 6.3). The protein is then eluted from the column with a 250 mM imidazole buffer 
(250 mM imidazole, lOOmM Sodium phosphate, monobasic, 10 mM Tris and 0.05% 
Triton X-100, pH 5.9). 

Final Formulation 

BASB027 was formulated by dialysis overnight against, three changes of 0.1 % Triton 
X-100 and lx PBS, pH 7.4, to remove residual Gu-HCI and imidazole. The purified 
protein was characterized and used to produce antibodies as described below. 

Biochemical Characterizations 
SDS-PAGE and Western Blot Analysis 

The recombinant purified protein was resolved on 4-20 % polyacrylamide gels and 
electrophoretically transferred to PVDF membranes at 100 V for 1 hour as previously 
described (Thebaine et al. 1979, Proc. Natl. Acad. Sci. USA 76:4350-4354). The 
PVDF membranes were then pretreated with 25 ml of Dulbecco's phosphate buffered 
saline containing 5 % non-fat dry milk. All subsequent incubations were carried out 
using this pretreatment buffer. 

PVDF membranes were incubated with 25 ml of a 1:500 dilution of preimmune serum 
or rabbit anti-His immune serum for 1 hour at room temperature. PVDF membranes 
were then washed twice with wash buffer (20 mM Tris buffer, pH 7.5, containing 150 
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mM sodium chloride and 0.05 % Tween-20). PVDF membranes were incubated with 
25 ml of a 1:5000 dilution of peroxidase-Iabeled goat anti-rabbit IgG (Jackson 
ImmunoResearch Laboratories, West Grove, PA) for 30 minutes at room temperature. 
PVDF membranes were then washed 4 times with wash buffer, and were developed 
with 3-amino-9-ethylcarbazole and urea peroxide as supplied by Zymed (San Francisco, 
CA) for 10 minutes each. 

The results of an SDS-PAGE (Figure 4) show a protein about 95 kDa that is reactive to 
an anti-RGS(His) antibody by western blots (Figure 5) of the SDS-PAGE. 

Protein Sequencing 

Amino terminal amino acid sequencing of the purified protein was performed to 
confirm the production of the correct recombinant protein using well defined chemical 
protocols on Hewlett-Packard model G1000A sequencer with a model 1090 LC and a 
Hewlett-Packard model 241 sequencer with a model 1 100 LC. 

Example 5 : Production of Antisera to Recombinant BASB027 

Polyvalent antisera directed against the BASB027 protein were generated by 
vaccinating two rabbits with the purified recombinant B ASB027 protein. Each animal 
is given a total of three immunizations intramuscullarly (i.m.) of about 20jig BASB027 
protein per injection (beginning with complete Freund's adjuvant and followed with 
incomplete Freund's adjuvant) at approximately 21 day intervals. Animals were bled 
prior to the first immunization ("pre-bleed") and on days 35 and 57. 

Anti-BASB027 protein titres were measured by an ELISA using purified recombinant 
BASB027 protein (0.5 jig/well). The titre is defined as the highest dilution equal to or 
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greater than 0.1 as calculated with the following equation: average OD of two test 
samples of antisera - the average OD of two test samples of buffer. 
The antisera were used as the first antibody to identify the protein in a western blot as 
described in example 4 above. The western-blot shows the presence of anti-BASB027 
antibody in the sera of immunized animals (Figure 6). 

Example 6 : Immunological Characterisation 

Western Blot Analysis 

Several strains of M catarrhalis were grown on chocolate agar plates for 48 hours at 
35°C in 5% C02. Several colonies were used to inoculate 25ml of Muller Hinton broth 
in a 250 ml flask. Cultures were grown overnight and collected by centrifugation. Cells 
were then solubilized by suspending 30|ag of cells in ISOjliI of PAGE sample buffer 
(360 mM Tris buffer, pH 8,8, containing 4% sodium dodecylsulfate and 20% glycerol), 
and incubating the suspension at 100°C for 5 minutes. The solubilized cells were 
resolved on 4-20% polyacrylamide gels and the separated proteins were 
electrophoretically transferred to PVDF membranes at 100V for lhour as previously 
described (Thebaine et al. 1979, Proc. Natl. Acad Sci. USA 76:4350-4354). The 
PVDF membranes were then pretreated with 25 ml of Dulbecco's phosphate buffered 
saline containing 5 % non-fat dry milk. All subsequent incubations were carried out 
using this pretreatment buffer. 

PVDF membranes were incubated with 25ml of a 1 :500 dilution of preimmune serum or 
rabbit immune serum for lhour at room temperature. PVDF membranes were then 
washed twice with wash buffer (20 mM Tris buffer, pH 7.5, containing 150 mM sodium 
chloride and 0.05% Tween-20). PVDF membranes were incubated with 25ml of a 
1:5000 dilution of peroxidase-labeled goat anti-rabbit IgG (Jackson IrnmunoResearch 
Laboratories, West Grove, PA) for 30 minutes at room temperature. PVDF membranes 
were then washed 4 times with wash buffer, and were developed with 3-amino-9- 
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ethylcarbazole and urea peroxide as supplied by Zymed (San Francisco, CA) for 10 
minutes each. 

A protein of about 95 kDa (corresponding to BASB027 expected molecular weight) that 
is reactive with the antisera is detected in all Moraxella strains (Figure 7). 

Bactericidal Activity 

Complement-mediated cytotoxic activity of anti-BASB027 antibodies was examined to 
determine the vaccine potential of BASB027 polypeptide. Antiserum was prepared as 
described above. The activities of the pre-immune serum and the anti- BASB027 
antiserum in mediating complement killing of M catarrhalis were examined using the 
"Serum Bactericidal Test 1 ' described by Zollinger et al. (Immune Responses to Neisseria 
meningitis, in Manual of Clinical Laboratory Immunology, 3rd ed., pg 347-349), except 
that cells of M catarrhalis strains or cultivars were used instead of Neisseria meningitis 
cells. 

The bactericidal titer of rabbit antiserum (50% killing of homologous strain) was <1 :8 
(pre-immune) and >1:128 (immune). 

Example 7 : Presence of Antibody to BASB027 in Human Convalescent Sera 

Western blot analysis of purified recombinant BASB027 were performed as described 
in Example 4 and 6 above, except that a pool of human sera from children infected by 
M catarrhalis was used as the first antibody preparation. Results show that antisera 
from naturally infected individuals react to the purified recombinant. 

Example 8 : Production of BASB027 peptides, Antisera and Reactivity Thereof 
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Two short amino acid BASB027 specific peptides, having the sequences of 
CYAKPLNKKQNDQTDT (SEQ ID NO:9) and YLTARRGQQTTLGEVVC (SEQ ID 
NO: 10) were produced in the laboratory using generally well known methods. These 
peptides coupled to KLH were used to produce antibodies in 12 weeks old Specific 
Pathogen Free New-Zealand female rabbits. Rabbits received 4 injections at 
approximately 3 weeks intervals of 200 jag of peptide-KLH in complete (1 st injection) or 
incomplete (2 nd , 3 rd and 4 th injections) Freund^s adjuvant. Animals were bled prior to the 
first immunization and one month after the 4 th injection. 

Anti-peptide mid-point titres were measured by an ELISA using free peptides. Anti- 
peptide Mid-point titres one month after the 4 ,[l immunization were superior to 15000. 
Western blots of purified recombinant BASB027, using anti-peptide antibodies as the 
first antibody, were prepared as described in Example 4 and 6. The results are presented 
in Figure 8. 
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Deposited materials 

A deposit containing a Moraxella catarrhalis Catiin strain has been deposited with the American 
Type Culture Collection (herein "ATCC") on June 21, 1997 and assigned deposit number 43617. 
The deposit was described as Branhamella catarrhalis (Frosch and Kolle) and is a freeze-dried, 1 .5- 
2.9 kb insert library constructed from M. catarrhalis isolate obtained from a transtracheal aspirate of 
a coal miner with chronic bronchitits. The deposit is described in Antimicrob. Agents Chemother. 
21: 506-508 (1982). 

The Moraxella catarrhalis strain deposit is referred to herein as "the deposited strain*' or as "the 
DNA of the deposited strain." 

The deposited strain contains a full length BASB027 gene. 

A deposit of the vector pMC-D15 consisting of Moraxella catarrhalis DNA inserted in pQE30 has 
been deposited with the American Type Culture Collection (ATCC) on February 12 1999 and 
assigned deposit number 207105. 

The sequence of the polynucleotides contained in the deposited strain / clone, as well as the amino 
acid sequence of any polypeptide encoded thereby, are controlling in the event of any conflict with 
any description of sequences herein. 

The deposit of the deposited strains have been made under the terms of the Budapest Treaty on the 
International Recognition of the Deposit of Micro-organisms for Purposes of Patent Procedure, The 
deposited strains will be irrevocably and without restriction or condition released to the public upon 
the issuance of a patent. The deposited strains are provided merely as convenience to those of skill 
in the art and are not an admission that a deposit is required for enablement, such as that required 
under 35 U.S.C. §112. 
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INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCT Rule \3bts) 

A. The indications made below reiace to the deposited microorganism or other biological material referred to in the description 
on page £S_ _ , hne 1-28 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 
Name of depositary institution 

AMERICAN TYPE CULTURE COLLECTION 

Address of depositary institution (including postal code and country) 

10801 UNIVERSITY BLVD, MANASSAS , VIRGINIA 20110-2209, 
UNITED STATES OF AMERICA 



Dateofdeposit ^ ^ (21-Q6-97) & 

12 February 1999 (12.02,99) 



Accession Number 

43617 & 207105 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



In respect of those designations where a European Patent is sought, a sample 
of the deposited microorganism will be made available until the publication 
of the mention of the grant of the European Patent or until the date on which 
the application has been refused or withdrawn, only by issue of such a sample 
to an expert nominated by the person requesting the sample. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau iater (specify the general nature of 'the indications e.g . "Accession 
Number of Deposit") 



For receiving Office use only 



| yj This sheet was received with the international application 



Authorized officer 






R.L.R. Pether 



For International Bureau use onlv 



|_ j This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 <Julvl99R> 
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CLAIMS; 

1 . An isolated polypeptide comprising an amino acid sequence which has at least 85% 
identity to the amino acid sequence selected from the group consisting of SEQ ID' NO:2 

5 and SEQ ID NO;4, over the entire length of SEQ ID NO;2 or SEQ ID NO:4 resp&rtively. 

2. An isolated polypeptide as claimed in claim 1 in which the amino acid sequence has at 
least 95% identity to the amino acid sequence selected from the group consisting of: SEQ 
ID NO;2 and SEQ ID NO:4, over the entire length of SEQ ID NO:2 or SEQ ID NO;4 

10 respectively. 

Q 3. The polypeptide as claimed in claim 1 comprising the amino acid sequence selected 
;J from the group consisting of SEQ ID NO:2 and SEQ ID NO:4. 

4 15 4. An isolated polypeptide having the amino acid sequence selected from the group 
1 2 consisting of SEQ ID NO:2 or SEQ ID NO:4. 

5. An immunogenic fragment of the polypeptide as claimed in any one of claims 1 to 4 in 
! 0 which the immunogenic fragment is capable of raising an immune response (if necessary 

20 when coupled to a carrier) which recognises the polypeptide of SEQ ID NO:2 or SEQ ID 
M NO:4. 

6. A polypeptide as claimed in any of claims 1 to 5 wherein said polypeptide is part of a 
larger fusion protein. 

25 

7. An isolated polynucleotide encoding a polypeptide as claimed in any of claims 1 to 6. 

8. An isolated polynucleotide comprising a nucleotide sequence encoding a polypeptide that 
has at least 85% identity to the amino acid sequence of SEQ JD NO:2 or 4 over the entire 

30 length of SEQ ID NO:2 or 4 respectively; or a nucleotide sequence complementary to said 
isolated polynucleotide. 
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9, An isolated polynucleotide comprising a nucleotide sequence that has at least 85% identity 
to a nucleotide sequence encoding a polypeptide of SEQ ID NO;2 or 4 over the entire coding 
region; or a nucleotide sequence complementary to said isolated polynucleotide. 

5 

10. An isolated polynucleotide which comprises a nucleotide sequence which has at least 
85% identity to that of SEQ ID NO: 1 or 3 over the entire length of SEQ ID NO: 1 or 3 
respectively; or a nucleotide sequence complementary to said isolated polynucleotide. 

10 11. The isolated polynucleotide as claimed in any one of claims 7 to J 0 in which tiie 
identity is at least 95% to SEQ ID NO: 1 or 3. 

^ 12. An isolated polynucleotide comprising a nucleotide sequence encoding the polypeptide 

3 of SEQ ID NO:2 or SEQ ID NO A 

|'f 13 . An isolated polynucleotide comprising the polynucleotide of SEQ ID NO: 1 or : JEQ ID 
NO:3. 

f U 14, An isolated polynucleotide comprising a nucleotide sequence encoding the polypeptide 
i« 5 20 of SEQ ID NO:2, SEQ ID NO;4 obtainable by screening an appropriate library under 
^ stringent hybridization conditions with a labeled probe having the sequence of SEQ ID NO: 1 
or SEQ ID NO 3 or a fragment thereof 

15. An expression vector or a live microorganism comprising an isolated recombinant 
25 polynucleotide according to any one of claims 7 - 14. 

16. A host cell comprising the expression vector of claim 1 5 expressing an isolated 
polypeptide comprising an amino acid sequence that has at least 85% identity to the amino 
acid sequence selected from the group consisting of SEQ ID NO: 2 and SEQ ID NO:4, or a 

30 membrane of the host cell comprising the expressed polypeptide. 
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17. A process for producing a polypeptide of claims 1 to 6 comprising culturing a host cell 
of claim 16 under conditions sufficient for the production of said polypeptide and 
recovering the polypeptide from the culture medium, 

s 18. A process for expressing a polynucleotide of any one of claims 7-14 comprising 
transforming a host cell with the expression vector comprising at least one of said 
polynucleotides and cuituring said host cell under conditions sufficient for expression of 
any one of said polynucleotides, 

10 1 9. A vaccine composition comprising an effective amount of the polypeptide of iiny one 
of claims 1 to 6 and a pharmaceutical^ acceptable carrier, 

20. A vaccine composition comprising an effective amount of the polynucleotide of any 
one of claims 7 to 14 and a pharmaceutical^ effective carrier. 

15 

21 . The vaccine composition according to either one of claims 19 or 20 wherein sud 
composition comprises at least one other Neisseria meningitidis antigen. 

22. An antibody immunospecific for the polypeptide or immunological fragment as 
20 claimed in any one of claims 1 to 6, 

23. A method of diagnosing a Neisseria meningitidis infection, comprising identifying a 
polypeptide as claimed in any one of claims 1 - 6, or an antibody that is immunosptcific for 
said polypeptide, present within a biological sample from an animal suspected of hsving 

25 such an infection, 

24. Use of a composition comprising an immunologically effective amount of a 
polypeptide as claimed in any one of claims 1 - 6 in the preparation of a medicament for 
use in generating an immune response in an animal. 
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25. Use of a composition comprising an immunologically effective amount of a 
polynucleotide as claimed in any one of claims 7 - 14 in the preparation of a medicament 
for use in generating an immune response in an animal. 

5 26. A therapeutic composition useful in treating humans with Neisseria meningitiiis 
comprising at least one antibody directed against the polypeptide of claims 1 - 6 and a 
suitable pharmaceutical carrier. 



to 




AMENDED SHEET 




SUBSTITUTE SHEET (RULE 2SJ 



09/70171 




SUBSTITUTE SHEET PULE 2S) 




SUBSTITUTE SHEET (RULE 25} 




SUBSTITUTE SWEET {RSJLE 26) 



09/701711 




SUBSTITUTE SHEET (BOLE 26) 




SUBSTITUTE SHEET (RULE 26) 



WO 99/63093 



09/701711 

PCT/EP99/03822 



7/26 




SUB ST1TIITS SHEET (RULE 28) 




SUBSTITUTE SHEET (RULE 26) 




SUBSTITUTE SH8ET (RULE 2S) 



WO 99/63093 



09/701711 

PCT/EP99/03822 



10/26 




SUBSTITUTE SHEET (HOLE 26) 




SUBSTITUTE SHEET (RULE 28) 



09/701711 




SUBSTITUTE SHEET (RULE 26) 



09/701711 



13/26 

Figure 2 : Alignment of the BASB027 polynucleotide sequences. 
Identity to SeqID No:l is indicated by a dot. 

20 * 40 * 
Seqidl : ATGCGTAATTCATATTTTAAAGGTTTTCAGGTCAGTGCAATGACAATGGC : 50 
Seqid3 : : 50 

60 * 80 * 100 
Seqidl : TGTCATGATGGTAATGTCAACTCATGCACAAGCGGCGGATTTTATGGCAA : 100 
Seqid3 : : 100 

* 12G * 140 
Seqidl : AT G AC AT T ACC ATC AC AGG ACT ACAGCGAGTG ACC AT T GAAAGCT T ACAA : 150 
Seqid3 : G : 150 

160 * 180 * 200 
Seqidl : AGCGTGCTGCCGTTTCGCTTGGGTCAAGTGGTGAGCGAAAACCAGTTGGC : 200 
Seqid3 ; GCA : 200 

220 * 240 * 
Seqidl : TGATGGTGTCAAAGCACTTTATGCAACAGGCAATTTTTCAGATGTGCAAG : 250 
Seqid3 : * : 250 

260 * 280 * 300 
Seqidl : TCTATCATCAAGAAGGGCGTATCATCTATCAGGTAACCGAAAGGCCGTTA : 300 
Seqid3 : . : 300 

320 * 340 
Seqidl : ATCGCTGAGATTAATTTTGAGGGCAATCGCTTAATTCCAAAAGAAGGTCT : 350 
Seqid3 : : 350 
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360 * 380 * 400 
Seqidl : ACAAGAAGGGCTAAAAAATGCTGGCTTAGCTGTGGGTCAACCACTAAAAC : 4 00 
Seqid3 : : 400 

+ 420 * 440 

Seqidl : AAGCCACAGTACAGATGATCGAAACCGAGCTTACCAATCAATATATATCA : 4 50 

Seqid3 : : 450 

460 * 480 * 500 
Seqidl : CAAGGCTATTATAATACCGAAATTACTGTCAAACAGACGATGCTTGATGG : 500 
Seqid3 : = 500 

* 520 * 540 * 
Seqidl : TAATCGTGTTAAGCTTGATATGACCTTTGCTGAAGGTAAACCTGCACGGG : 550 
Seqid3 : : 550 

560 + 580 * 600 
Seqidl : TGGTTGATATTAATATCATTGGCAATCAGCATTTTAGCGATGCAGATTTG : 600 
Seqid3 : : 600 

620 * 640 * 

Seqidl : ATTGATGTGCTTGCGATTAAGGATAATAAAATCAATCCACTGTCTAAAGC : 650 

Seqid3 : : 650 

660 * 680 * "700 

Seqidl : TGACCGTTATACTCAAGAAAAGCTGGTGACCAGTTTAGAGAATTTGCGTG : 7 00 

Seqid3 : : 700 



* 720 * 740 

Seqidl : CTAAATATCTCAATGCAGGGTTTGTGCGTTTTGAGATTAAAGATGCTAAG 



750 
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: 750 



760 * 780 * 800 

Seqidl : CTTAATATTAATGAAGATAAAAACCGTATCTTTGTTGAGATTTCATTGCA : 8 00 

c^,^ . : 800 

SeqicJ : * * 



820 * 840 

Seqidl : TGAAGGTGAGCAATATCGCTTTGGACAGACACAGTTTTTGGGTAATTTAA : 8 50 

Seaid3 : : 850 



860 * 880 " 900 

Seqidl : CTTATACTCAAGCAGAACTTGAGGCACTGCTTAAATTCAAAGCAGAAGAA : 900 

Seqid3 : : 900 



* 920 * 940 

Seqidl : GGGTTTTCACAAGCCATGCTTGAGCAAACAACAAACAATATCAGTACCAA : 950 

Seqid3 : : 950 



960 * 980 * 100° 

Seqidl : ATTTGGTGACGATGGCTATTATTATGCTCAAATCCGTCCTGTAACACGCA : 1000 

Seqid3 : : 1000 



★ 1020 * 1040 * 

Seqidl : TTAATGATGAAAGTCGTACGGTTGATGTGGAATATTATATTGACCCTGTA : 1050 

Seqid3 : : 1050 



1060 * 1080 * 1100 
Seqidl : CACCCTGTCTATGTACGCCGTATTAATTTTACAGGTAACTTTAAGACCCA : 1100 
c^^^ . : 1100 
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* 1120 * 1140 * 
Seqidl : AGATGAAGTACTCCGTCGTGAGATGCGACAACTTGAAGGTGCGTTGGCAT : 1150 
Seqid3 : : 1150 

1160 * 1180 * 1200 
Seqidl : CTAATCAAAAAATCCAGCTGTCTCGTGCACGCTTGATGCGGACTGGGTTT : 1200 
Seqid3 : : 1200 

1220 * 1240 * 

Seqidl : TTTAAACATGTTACCGTTGATACTCGTCCAGTACCCAACTCACCTGATCA : 12 50 

Seqid3 : : 1250 

1260 * 1280 * 1300 
Seqidl : GGTTGATGTAAATTTTGTGGTTGAAGAACAACCTTCAGGATCATCAACCA : 1300 
Seqid3 : : 1300 

* 1320 * 1340 * 
Seqidl : TCGCAGCAGGCTACTCTCAAAGTGGTGGTGTAACTTTTCAATTTGATGTT : 135 0 

Seqid3 : : 1350 

1360 * 1380 * 1400 
Seqidl : TCTCAAAATAACTTTATGGGTACAGGTAAGCACGTCAATGCTTCGTTTTC : 14 00 
Seqid3 : : 1400 

* 1420 * 1440 * 
Seqidl : TCGCTCTGAGACCCGTGAGGTGTATAGTTTGGGTATGACCAACCCATACT : 14 50 

Seqid3 : : 1450 

1460 * 1480 * 1500 
Seqidl : TTACCGTAAATGGCGTCTCGCAAAGCTTGAGTGGCTACTATCGTAAAACC : 1500 
Seqid3 : = 1500 
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* 1520 * 1540 
SeqidI : AAGTATGATAACAAGAACATTAGTAATTATGTACTTGATTCTTATGGTGG : 15 50 
Seqid3 : : 1550 



1560 * 1580 * 1600 
SeqidI : CTCATTAAGCTATGGATATCCAATTGATGAAAATCAACGCATAAGCTTTG : 1600 
Seaid3 : : 1600 



* 1620 * 1640 
SeqidI : GTCTGAATGCTGACAATACCAAGCTTCATGGCGGTCGTTTTATGGGCATT : 1650 
Seqid3 : : 1650 



1660 * 1680 * 1700 

SeqidI : AGTAATGTCAAGCAGCTGATGGCAGATGGTGGCAAAATTCAAGTGGATAA : 1700 

Seqid3 : : 1700 



* 1720 * 1740 * 

SeqidI : TAATGGCATTCCTGATTTTAAGCATGATTACACAACCTACAATGCCATTT : 17 50 

Seqid3 : : 1750 



1760 * 1780 * 1800 
SeqidI : TGGGGTGGAATTATTCAAGTCTAGATCGCCCTGTATTTCCAACCCAAGGC : 18 00 
Seqid3 : : 1800 



* 1820 * 1840 * 

SeqidI : ATGAGTCATTCTGTAGATTTGACGGTTGGTTTTGGTGATAAAACTCATCA : 1850 

Seqid3 : : 1850 



I860 * 1880 * 1900 

SeqidI : AAAAGTGGTTTATCAAGGCAATATCTATCGCCCATTTATCAAAAAATCAG : 1900 
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SeqicB : : 1900 

1920 - 1940 
Seqidl : T C T T G C G T G G A T A C G C C AAG T TAG G C T AT G G C AA T AAT T T AC C A T T T T AT : 19 50 
Seqid3 : : 1950 

1960 * 1930 - 2000 
Seqidl : GAAAATTTCTATGCAGGCGGCTATGGTTCGGTTCGTGGCTATGATCAATC : 2000 
Seqid3 : : 2000 

2020 * 2040 
Seqidl : CTCTTTGGGTCCACGCTCACAAGCCTATTTGACAGCTCGTCGTGGTCAAC : 2050 
Seqid3 : : 2050 

2060 * 2080 * 2100 
Seqidl : AAACCACACTAGGAGAGGTTGTTGGTGGTAATGCTTTGGCAACTTTCGGC : 2100 
Seqid3 : : 2100 

2120 * 2140 * 
Seqidl : AGTGAGCTGATTTTACCTTTGCCATTTAAAGGTGATTGGATAGATCAGGT : 215 0 
Seqid3 : : 2150 

2160 * 2180 * 2200 
Seqidl : GCGTCCAGTGATATTCATTGAGGGCGGTCAGGTTTTTGATACAACAGGTA : 2200 
Seqid3 : : 2200 

2220 * 2240 
Seqidl : T GG AT AAAC AAAC C AT T G AT T T AAC C C AAT T TAAAG AC CC AC AAG CAAC A : 22 50 
Seqid3 : ; 2250 
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2260 * 22S0 * 230C 
Seqidl : G C T G AAC AAAAT G C AAAAG C AG C C AA7 C GCCCGC? AC T AAC C C AAG AT AA : 23CC 
SeqiciB : . 23CC 

2320 * 2340 
Seqidl : ACAGTTGCGTTATAGTGCTGGTGTTGGTGCAACTTGGTATACGCCCATTG : 2 3 5 G 
Seqid3 : . 2350 

2360 - 2380 * 2400 
Seqidl : G T C C T T T AT CTATTAGCTATG C C AAG C C A T T G AA T AAAAAAC AAAAT GAT : 24 00 
Seqid3 : : 2400 

* 2420 - 2440 
Seqidl : CAGACCGATACGGTACAGTTCCAGATTGGTAGTGTCTTTTAA : 24 42 
Seqid3 : ; 2442 
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Figure 3 : Alignment of the BASB027 polypeptide sequences. 
Identity to SeqID No:2 is indicated by a dot. 

* 20 * 40 

Seqid2 : MRNSYFKGFQVSAMTMAVMMVMSTHAQAADFMANDITITGLQRVTIESLQ : 50 
Seqid4 : & : 50 

60 * 80 100 
Seqid2 : SVLPFRLGQWSENQLADGVKALYATGNFSDVQVYHQEGRII YQVTERPL : 100 
Seqid4 : A : 10Q 

* 120 * 140 * 
Seqid2 : IAEINFEGNRLI FKEGLQEGLKNAGLAVGQPLKQATVQMIETELTNQYIS : 150 
Seqid4 : : 150 

160 * 180 * 200 
Seqid2 : QGYYNTEITVKQTMLDGNRVKLDMTFAEGKPARWDINIIGNQHFSDADL : 200 
Seqid4 : . 200 

* 220 * 240 * 
Seqid2 : IDVLAIKDNKINPLSKADRYTQEKLVTSLENLRAKYLNAGFVRFEIKDAK : 250 
Seqid4 : . 2 50 

260 * 280 * 300 
Seqid2 : LNINEDKNRIFVEISLHEGEQYRFGQTQFLGNLTYTQAELEALLKFKAEE : 300 
Seqid4 : . 300 

320 * 340 
Seqid2 : GFSQAMLEQTTNNISTKFGDDGYYYAQIRPVTRINDESRTVDVEYYIDPV : 350 
Seqid4 : . 350 
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360 * 380 * 400 

Seqid2 : KPVYVRRINFTGNFKTQDEVLRREMRQLEGALASNQKIQLSRARLMRTGF : 4 00 

Seqid4 : . 4 qq 

420 * 440 

Seqid2 : FKHVTVDTRPVPNSPDQVDVNFVVEEQPSGSSTIAAGYSQSGGVTFQFDV : 4 50 

Seqid4 : : 450 

460 * 480 - 500 

Seqid2 : SQNNFMGTGKHVNASFSRSETREVYSLGMTNPYFTVNGVSQSLSGYYRKT : 500 

Seqid4 : : 50 0 

520 * 540 

Seqid2 : KYDNKNISNYVLDSYGGSLSYGYPI DENQRISFGLNADNTKLHGGRFMGI : 550 

Seqid4 : : 550 

560 * 580 * 600 

Seqid2 : SNVKQLMADGGKIQVDNNGIPDFKHDYTTYNAILGWNYSSLDRPVFPTQG : 600 

Seqid4 : : g 0 0 

* 620 * 640 * 

Seqid2 ; MSHSVDLTVGFGDKTHQKVVYQGNI YRPFIKKSVLRGYAKLGYGNNLPFY ; 650 

Seqid4 : . 650 

660 * 680 * 700 

Seqid2 : ENFYAGGYGSVRGYDQSSLGPRSQAYLTARRGQQTTLGEVVGGNALATFG : 7 00 

Seqid4 : : 70 0 

720 * 740 

Seqid2 : SELILPLPFKGDWIDQVRPVI FIEGGQVFDTTGMDKQTI DLTQFKDPQAT : 750 
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SeqicM : ; 750 

760 - 780 - 80C 
Seqid2 : AEQNAKAANRPLLTQDKQLRYSAGVGATWYTPIGPLS ISYAKPLNKKQNC : 8 00 
SeqicU : : 800 

■*■ 

Seqici2 : QTDTVQFQIGSVF : 813 
Seqid4 : : 813 
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Figure 4: Coomassie stained SDS-PAGE of purified BASB027 protein. 
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Figure S: Western blot with tera-His antibody of purified 8ASB027 protein. 
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Figure 7: Western blot of hole eel! lysates of 16 strains of M camrrhalis using 
pooled sera against the BASB027 protein (sera was diluted 1:2000). 
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Figure 8: Western blot of purified recombinant BASB027 wsfh corresponding anti- 
peptide sera. Lanes 2 and 3 bob immune sera. Lanes 1 aed 4 immune sera. 



MWEVl J 2 3 4 




SUBSTITUTE SHEET (RULE 26} 



09/701711 

99/63093 PCT/EP99/03822 

SEOUENCS LISTING 
<110> SmithKline Beecham. Biologicals 
<12 0> Novel Compounds 

<130> BM45324 
<160> 10 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 2442 
<212> DNA 
<213> Bacteria 

<400> 1 

atgcgtaatt catattttaa aggttttcag gtcagtgcaa tgacaatggc tgtcatgatg 60 

gtaatgtcaa ctcatgcaca agcggcggat tttatggcaa atgacattac catcacagga 120 

ctacagcgag tgaccattga aagcttacaa agcgtgctgc cgtttcgctt gggtcaagtg 180 

gtgagcgaaa accagttggc tgatggtgtc aaagcacttt atgcaacagg caatttttca 240 

gatgtgcaag tctatcatca agaagggcgt atcatctatc aggtaaccga aaggccgtta 3 00 

atcgctgaga ttaattttga gggcaatcgc ttaattccaa aagaaggtct acaagaaggg 360 

ctaaaaaatg ctggcttagc tgtgggtcaa ccactaaaac aagccacagt acagatgatc 420 

gaaaccgagc ttaccaatca atatatatca caaggctatt ataataccga aat tactgtc 480 

aaacagacga tgcttgatgg taatcgtgtt aagcttgata tgacctttgc tgaaggtaaa 540 

cctgcacggg tggttgatat taatatcatt ggcaatcagc attttagcga tgcagatttg 600 

attgatgtgc ttgcgattaa ggataataaa atcaatccac tgtctaaagc tgaccgttat 660 

actcaagaaa agctggtgac cagtttagag aatttgcgtg ctaaatatct caatgcaggg 720 

tttgtgcgtt ttgagattaa agatgctaag cttaatatta atgaagataa aaaccgtatc 780 

tttgttgaga tttcattgca tgaaggtgag caatatcgct ttggacagac acagtttttg 840 

ggtaatttaa cttatactca agcagaactt gaggcactgc ttaaattcaa agcagaagaa 900 

gggttttcac aagccatgct tgagcaaaca acaaacaata tcagtaccaa atttggtgac 960 

gatggctatt attatgctca aatccgtcct gtaacacgca ttaatgatga aagtcgtacg 1020 

gttgatgtgg aatattatat tgaccctgta caccctgtct atgtacgccg tattaatttt 1080 

acaggtaact ttaagaccca agatgaagta ctccgtcgtg agatgcgaca acttgaaggt 1140 

gcgttggcat ctaatcaaaa aatccagctg tctcgtgcac gcttgatgcg gactgggttt 1200 

tttaaacatg ttaccgttga tactcgtcca gtacccaact cacctgatca ggttgatgta 1260 

aattttgtgg ttgaagaaca accttcagga tcatcaacca tcgcagcagg ctactctcaa 1320 

agtggtggtg taacttttca atttgatgtt tctcaaaata actttatggg tacaggtaag 13 80 

cacgtcaatg cttcgttttc tcgctctgag acccgtgagg tgtatagttt gggtatgacc 1440 

aacccatact ttaccgtaaa tggcgtctcg caaagcttga gtggctacta tcgtaaaacc 1500 

aagtatgata acaagaacat tagtaattat gtacttgatt cttatggtgg ctcattaagc 1560 

tatggatatc caattgatga aaatcaacgc ataagctttg gtctgaatgc tgacaatacc 1620 

aagcttcatg gcggtcgttt tatgggcatt agtaatgtca agcagctgat ggcagatggt 1680 

ggcaaaattc aagtggataa taatggcatt cctgatttta agcatgatta cacaacctac 1740 

aatgccattt tggggtggaa ttattcaagt ctagatcgcc ctgtatttcc aacccaaggc 1800 

atgagtcatt ctgtagattt gacggttggt tttggtgata aaactcatca aaaagtggtt 1860 

tatcaaggca atatctatcg cccatttatc aaaaaatcag tcttgcgtgg atacgccaag 1920 

ttaggctatg gcaataattt accattttat gaaaatttct atgcaggcgg ctatggttcg 1980 

gttcgtggct atgatcaatc ctctttgggt ccacgctcac aagcctattt gacagctcgt 2040 

cgtggtcaac aaaccacact aggagaggtt gttggtggta atgctttggc aactttcggc 2100 

agtgagctga ttttaccttt gccatttaaa ggtgattgga tagatcaggt gcgtccagtg 2160 

atattcattg agggcggtca ggtttttgat acaacaggta tggataaaca aaccattgat 2220 

ttaacccaat ttaaagaccc acaagcaaca gctgaacaaa atgcaaaagc agccaatcgc 2280 

ccgctactaa cccaagataa acagttgcgt tatagtgctg gtgttggtgc aacttggtat 2340 
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acgcccattg gtcctttatc tattagctat gccaagccat tgaanaaaaa acaaaatgat 
cagaccgata cggtacagtt ccagattggt agtgtctttt aa 

<210> 2 
<211> 813 
<212> PRT 
<213> Bacteria 

<400> 2 

Met Arg Asn Ser Tyr Phe Lys Gly Phe Gin Val Ser Ala Met Thr Met 

15 10 15 

Ala Val Met Met Val Met Ser Thr His Ala Gin Ala Ala Asp Phe Met 

20 25 30 

Ala Asn Asp lie Thr lie Thr Gly Leu Gin Arg Val Thr lie Glu Ser 

35 40 45 

Leu Gin Ser Val Leu Pro Phe Arg Leu Gly Gin Val Val Ser Glu Asn 

50 55 60 

Gin Leu Ala Asp Gly Val Lys Ala Leu Tyr Ala Thr Gly Asn Phe Ser 
65 * 70 75 80 

Asp Val Gin Val Tyr His Gin Glu Gly Arg lie lie Tyr Gin Val Thr 

85 90 95 

Glu Arg Pro Leu lie Ala Glu lie Asn Phe Glu Gly Asn Arg Leu He 

100 105 110 

Pro Lys Glu Gly Leu Gin Glu Gly Leu Lys Asn Ala Gly Leu Ala Val 

115 120 125 

Gly Gin Pro Leu Lys Gin Ala Thr Val Gin Met lie Glu Thr Glu Leu 

130 135 140 

Thr Asn Gin Tyr He Ser Gin Gly Tyr Tyr Asn Thr Glu He Thr Val 
145 150 155 ISO 

Lys Gin Thr Met Leu Asp Gly Asn Arg Val Lys Leu Asp Met Thr Phe 

165 170 175 

Ala Glu Gly Lys Pro Ala Arg Val Val Asp He Asn He He Gly Asn 

180 185 190 

Gin His Phe Ser Asp Ala Asp Leu He Asp Val Leu Ala He Lys Asp 

195 200 205 

Asn Lys He Asn Pro Leu Ser Lys Ala Asp Arg Tyr Thr Gin Glu Lys 

210 215 220 

Leu Val Thr Ser Leu Glu Asn Leu Arg Ala Lys Tyr Leu Asn Ala Gly 
225 230 235 240 

Phe Val Arg Phe Glu He Lys Asp Ala Lys Leu Asn He Asn Glu Asp 

245 250 255 

Lys Asn Arg He Phe Val Glu He Ser Leu His Glu Gly Glu Gin Tyr 

260 265 270 

Arg Phe Gly Gin Thr Gin Phe Leu Gly Asn Leu Thr Tyr Thr Gin Ala 

275 280 285 

Glu Leu Glu Ala Leu Leu Lys Phe Lys Ala Glu Glu Gly Phe Ser Gin 

290 295 300 

Ala Met Leu Glu Gin Thr Thr Asn Asn He Ser Thr Lys Phe Gly Asp 
305 310 315 320 

Asp Gly Tyr Tyr Tyr Ala Gin He Arg Pro Val Thr Arg He Asn Asp 

325 330 335 

Glu Ser Arg Thr Val Asp Val Glu Tyr Tyr He Asp Pro Val His Pro 

340 345 350 

Val Tyr Val Arg Arg He Asn Phe Thr Gly Asn Phe Lys Thr Gin Asp 

355 360 365 

Glu Val Leu Arg Arg Glu Met Arg Gin Leu Glu Gly Ala Leu Ala Ser 

370 375 380 

Asn Gin Lys He Gin Leu Ser Arg Ala Arg Leu Met Arg Thr Gly Phe 



24C0 
2442 
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385 390 395 400 

p he L ys His Val Thr Val Asp Thr Arg Pro Val Pro Asn Ser Pro Asp 

405 410 415 

Gin Val Asp Val Asn Phe Val Val Glu Glu Gin Pro Ser Gly Ser Ser 

42 0 42 5 43 C 

Thr He Ala Ala Gly Tyr Ser Gin Ser Gly Gly Val Thr Pne Gin Phe 

435 440 445 

Asp Val Ser Gin Asn Asn Phe Met Gly Thr Gly Lys His Val Asn Ala 

450 455 460 

Ser Phe Ser Arg Ser Glu Tr.r Arg Glu Val Tyr Ser Leu Gly Met Thr 
46 S 470 475 480 

Asn Pro Tyr Phe Thr Val Asn Gly Val Ser Gin Ser Leu Ser Gly Tyr 

485 490 495 

Tyr Arg Lys Thr Lys Tyr Asp Asn Lys Asn He Ser Asn Tyr Val Leu 

500 505 510 

Asp Ser Tyr Gly Gly Ser Leu Ser Tyr Gly Tyr Pro He Asp Glu Asn 

515 520 525 

Gin Arg He Ser Phe Gly Leu Asn Ala Asp Asn Thr Lys Leu His Gly 

530 535 540 

Gly Arg Phe Met Gly He Ser Asn Val Lys Gin Leu Met Ala Asp Gly 
545 550 555 560 

Gly Lys He Gin Val Asp Asn Asn Gly He Pro Asp Phe Lys His Asp 

565 570 575 

Tyr Thr Thr Tyr Asn Ala He Leu Gly Trp Asn Tyr Ser Ser Leu Asp 

580 585 590 

Arg Pro Val Phe Pro Thr Gin Gly Met Ser His Ser Val Asp Leu Thr 

595 600 605 

Val Gly Phe Gly Asp Lys Thr His Gin Lys Val Val Tyr Gin Gly Asn 

610 615 620 

He Tyr Arg Pro Phe He Lys Lys Ser Val Leu Arg Gly Tyr Ala Lys 
625 630 635 640 

Leu Gly Tyr Gly Asn Asn Leu Pro Phe Tyr Glu Asn Phe Tyr Ala Gly 

645 650 655 

Gly Tyr Gly Ser Val Arg Gly Tyr Asp Gin Ser Ser Leu Gly Pro Arg 

660 665 670 

Ser Gin Ala Tyr Leu Thr Ala Arg Arg Gly Gin Gin Thr Thr Leu Gly 

675 680 685 

Glu Val Val Gly Gly Asn Ala Leu Ala Thr Phe Gly Ser Glu Leu He 

690 695 700 

Leu Pro Leu Pro Phe Lys Gly Asp Trp He Asp Gin Val Arg Pro Val 
705 710 715 7 20 

He Phe He Glu Gly Gly Gin Val Phe Asp Thr Thr Gly Met Asp Lys 

725 730 735 

Gin Thr He Asp Leu Thr Gin Phe Lys Asp Pro Gin Ala Thr Ala Glu 

740 745 750 

Gin Asn Ala Lys Ala Ala Asn Arg Pro Leu Leu Thr Gin Asp Lys Gin 

755 760 765 

Leu Arg Tyr Ser Ala Gly Val Gly Ala Thr Trp Tyr Thr Pro He Gly 

770 775 780 

Pro Leu Ser He Ser Tyr Ala Lys Pro Leu Asn Lys Lys Gin Asn Asp 
785 790 795 800 

Gin Thr Asp Thr Val Gin Phe Gin He Gly Ser Val Phe 
805 810 



<210> 3 
<211> 2442 
<212> DNA 
<213> Bacteria 
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<40G> 3 

atgcgcaat: catattttaa aggttttcag gtcagtgcaa tgacaatggc tgtcatgatg 
gtaatgccaa ctcatgcaca agcggcggat tttatggcaa atgacattgc catcacagga 
ctacagcgag tgaccattga aagcttacaa agcgtgctgc cgtttcgctt gggtcaagtg 
gtgagcgaag cacagttggc tgatggtgtc aaagcacttt atgcaacagg caatttttca 
gatgtgcaag tctatcatca agaagggcgt atcatctatc aggtaaccga aaggccgtta 
atcgccgaga ttaattttga gggcaatcgc ttaattccaa aagaaggtct acaagaaggg 
ccaaaaaatg ctggcttagc tgtgggtcaa ccactaaaac aagccacagt acagatgatc 
gaaaccgagc ttaccaatca atatatatca caaggctatt ataataccga aattactgtc 
aaacagacga tgcttgatgg taatcgtgtt aagcttgata tgacctttgc tgaaggtaaa 
cctgcacggg tggttgatat taatatcatt ggcaatcagc attttagcga tgcagatttg 
attgatgtgc ttgcgattaa ggataataaa atcaatccac tgtctaaagc tgaccgttat 
actcaagaaa agctggtgac cagtttagag aatttgcgtg ctaaatatct caatgcaggg 
tttgtgcgtt ttgagattaa agatgctaag cttaatatta atgaagataa aaaccgtatc /au 
tttgttgaga tttcattgca tgaaggtgag caatatcgct ttggacagac acagtttttg 340 
ggtaatttaa cttatactca agcagaactt gaggcactgc ttaaattcaa agcagaagaa 90 
gggttttcac aagccatgct tgagcaaaca acaaacaata tcagtaccaa atttggtgac 
gatggctatt attatgctca aatccgtcct gtaacacgca ttaatgatga aagtcgtacg 
gttgatgtgg aatattatat tgaccctgta caccctgtct atgtacgccg tattaatttt 1080 
acaggtaact ttaagaccca agatgaagta ctccgtcgtg agatgcgaca acttgaaggt ^ ^r, 
gcgttggcat ctaatcaaaa aatccagctg tctcgtgcac gcttgatgcg gactgggttt 
tttaaacatg ttaccgttga tactcgtcca gtacccaact cacctgatca ggttgatgta 
aatttcgtgg ttgaagaaca accttcagga tcatcaacca tcgcagcagg ctactctcaa 
agtggtggtg caacttttca atttgatgtt tctcaaaata actttatggg tacaggtaag 
cacgtcaatg cttcgttttc tcgctctgag acccgtgagg tgtatagttt gggtatgacc 
aacccatact ttaccgtaaa tggcgtctcg caaagcttga gtggctacta tcgtaaaacc 
aagtatgata acaagaacat tagtaattat gtacttgatt cttatggtgg ctcattaagc 
tatggatatc caattgatga aaatcaacgc ataagctttg gtctgaatgc tgacaatacc 
aagcttcatg gcggtcgttt tatgggcatt agtaatgtca agcagctgat ggcagatggt 
ggcaaaattc aagtggataa taatggcatt cctgatttta agcatgatta cacaacctac 
aatgccattt tggggtggaa ttattcaagt ctagatcgcc ctgtatttcc aacccaaggc 
atgagtcatt ctgtagattt gacggttggt tttggtgata aaactcatca aaaagtggtt 
tatcaaggca atatctatcg cccatttatc aaaaaatcag tcttgcgtgg atacgccaag 
ttaggctatg gcaataattt accattttat gaaaatttct atgcaggcgg ctatggttcg 
gttcgtggct atgatcaatc ctctttgggt ccacgctcac aagcctattt gacagctcgt 
cgtggtcaac aaaccacact aggagaggtt gttggtggta atgctttggc aactttcggc 
agtgagctga ttttaccttt gccatttaaa ggtgattgga tagatcaggt gcgtccagtg 
atattcattg agggcggtca ggtttttgat acaacaggta tggataaaca aaccattgat 
ttaacccaat ttaaagaccc acaagcaaca gctgaacaaa atgcaaaagc agccaatcgc 
ccgctactaa cccaagataa acagttgcgt tatagtgctg gtgttggtgc aacttggtat 
acgcccattg gtcctttatc tattagctat gccaagccat tgaataaaaa acaaaatgat 
cagaccgata cggtacagtt ccagattggt agtgtctttt aa 



60 
12C 
18G 
240 
300 
360 
420 
480 
540 
600 
660 
72C 
780 



960 
1020 



114C 
1200 
1260 
1320 
1330 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2442 



<210> 4 
<211> 813 
<212=> PRT 
<213> Bacteria 
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Gly 


Phe 


Gin 


Val 


Ser 


Ala 


Met 


Thr 


Met 


1 






5 










10 










15 




Ala 


Val 


Met 


Met 


val 


Met 


Ser 


Thr 


His 


Ala 


Gin 


Ala 


Ala 


Asp 


Phe 


Met 








20 










25 










30 






Ala 


Asn 


Asp 


lie 


Ala 


He 


Thr 


Gly 


Leu 


Gin 


Arg 


Val 


Thr 


He 


Glu 


Ser 






35 










40 










45 






Ala 


Leu 


Gin 


Ser 


Val 


Leu 


Pro 


Phe 


Arg 


Leu 


Gly 


Gin 


Val 


Val 


Ser 


Glu 



50 55 60 
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Gin Leu Ala Asp Gly Val Lys Ala Leu Tyr Ala Thr Gly Asn Phe Ser 
65 70 75 80 

Asp Val Gin Val Tyr His Gin Glu Gly Arg lie He Tyr Gin Val Thr 

85 90 95 

Glu Arg Pro Leu He Ala Glu He Asn Phe Glu Gly Asn Arg Leu He 

100 105 HO 

Pro Lys Glu Gly Leu Gin Glu Gly Leu Lys Asn Ala Gly Leu Ala Val 

115 120 125 

Gly Gin Pro Leu Lys Gin Ala Thr Val Gin Met He Glu Thr Glu Leu 

130 135 140 

Thr Asn Gin Tyr He Ser Gin Gly Tyr Tyr Asn Thr Glu He Thr Val 
145 150 155 160 

Lys Gin Thr Met Leu Asp Gly Asn Arg Val Lys Leu Asp Met Thr Phe 

165 170 175 

Ala Glu Gly Lys Pro Ala Arg Val Val Asp lie Asn He lie Gly Asn 

180 135 130 

Gin Kis Phe Ser Asp Ala Asp Leu He Asp Val Leu Ala He Lys Asp 

195 200 205 

Asn Lys He Asn Pro Leu Ser Lys Ala Asp Arg Tyr Thr Gin Glu Lys 

210 215 220 

Leu Val Thr Ser Leu Glu Asn Leu Arg Ala Lys Tyr Leu Asn Ala Gly 
225 230 235 240 

Phe Val Arg Phe Glu He Lys Asp Ala Lys Leu Asn He Asn Glu Asp 

245 250 255 

Lys Asn Arg He Phe Val Glu He Ser Leu His Glu Gly Glu Gin Tyr 

260 265 270 

Arg Phe Gly Gin Thr Gin Phe Leu Gly Asn Leu Thr Tyr Thr Gin Ala 

275 280 285 

Glu Leu Glu Ala Leu Leu Lys Phe Lys Ala Glu Glu Gly Phe Ser Gin 

290 295 300 

Ala Met Leu Glu Gin Thr Thr Asn Asn He Ser Thr Lys Phe Gly Asp 
305 310 315 320 

Asp Gly Tyr Tyr Tyr Ala Gin He Arg Pro Val Thr Arg lie Asn Asp 

325 330 335 

Glu Ser Arg Thr Val Asp Val Glu Tyr Tyr He Asp Pro Val His Pro 

340 345 350 

Val Tyr Val Arg Arg He Asn Phe Thr Gly Asn Phe Lys Thr Gin Asp 

355 360 365 

Glu Val Leu Arg Arg Glu Met Arg Gin Leu Glu Gly Ala Leu Ala Ser 

370 375 380 

Asn Gin Lys He Gin Leu Ser Arg Ala Arg Leu Met Arg Thr Gly Phe 
385 390 395 400 

Phe Lys His Val Thr Val Asp Thr Arg Pro Val Pro Asn Ser Pro Asp 

405 410 415 

Gin Val Asp Val Asn Phe Val Val Glu Glu Gin Pro Ser Gly Ser Ser 

420 425 430 

Thr He Ala Ala Gly Tyr Ser Gin Ser Gly Gly Val Thr Phe Gin Phe 

435 440 445 

Asp Val Ser Gin Asn Asn Phe Met Gly Thr Gly Lys His Val Asn Ala 

450 455 460 

Ser Phe Ser Arg Ser Glu Thr Arg Glu Val Tyr Ser Leu Gly Met Th 
465 470 475 480 

Asn Pro Tyr Phe Thr Val Asn Gly Val Ser Gin Ser Leu Ser Gly Tyr 

485 490 495 

Tyr Arg Lys Thr Lys Tyr Asp Asn Lys Asn He Ser Asn Tyr Val Leu 

500 505 510 

Asp Ser Tyr Gly Gly Ser Leu Ser Tyr Gly Tyr Pro He Asp Glu Asn 
515 520 525 



.r 
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Gin Arg He Ser Phe Gly Leu Asn Ala Asp Asn Thr Lys Leu His Gly 

530 535 540 

Gly Arg Phe Met Gly lie Ser Asn Val Lys Gin Leu Met Ala Asp Gly 
545 550 555 560 

Gly Lys lie Gin Val Asp Asn Asn Gly He Pro Asp Phe Lys His Asp 

565 570 575 

Tyr Thr Thr Tyr Asn Ala He Leu Gly Trp Asn Tyr Ser Ser Leu Asp 

580 585 590 

Arg Pro Val Phe Pro Thr Gin Gly Met Ser His Ser Val Asp Leu Thr 

595 600 605 

Val Gly Phe Gly Asp Lys Thr His Gin Lys Val Val Tyr Gin Gly Asn 

610 615 620 

He Tyr Arg Pro Phe He Lys Lys Ser Val Leu Arg Gly Tyr Ala Lys 
625 630 635 640 

Leu Gly Tyr Gly Asn Asn Leu Pro Phe Tyr Glu Asn Phe Tyr Ala Gly 

645 650 655 

Gly Tyr Gly Ser Val Arg Gly Tyr Asp Gin Ser Ser Leu Gly Pro Arg 

660 665 670 

Ser Gin Ala Tyr Leu Thr Ala Arg Arg Gly Gin Gin Thr Thr Leu Gly 

675 660 685 

Glu Val Val Gly Gly Asn Ala Leu Ala Thr Phe Gly Ser Glu Leu He 

690 695 700 

Leu Pro Leu Pro Phe Lys Gly Asp Trp He Asp Gin Val Arg Pro Val 
705 710 715 V20 

He Phe He Glu Gly Gly Gin Val Phe Asp Thr Thr Gly Met Asp Lys 

725 730 735 

Gin Thr He Asp Leu Thr Gin Phe Lys Asp Pro Gin Ala Thr Ala Glu 

740 745 750 

Gin Asn Ala Lys Ala Ala Asn Arg Pro Leu Leu Thr Gin Asp Lys Gin 

755 760 765 

Leu Arg Tyr Ser Ala Gly Val Gly Ala Thr Trp Tyr Thr Pro He Gly 

770 775 780 

Pro Leu Ser He Ser Tyr Ala Lys Pro Leu Asn Lys Lys Gin Asn Asp 
785 790 795 800 

Gin Thr Asp Thr Val Gin Phe Gin He Gly Ser Val Phe 
805 810 

<210> 5 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 5 

actatagggc acgcgtg 17 

<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 6 
cctgcgtttg tttgattgag 



2G 
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<210> 7 
<211> 61 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 7 

aagggcccaa ttacgcagag gggatccaca ggactacagc gagtgaccat tgaaagctta 60 
c 61 

<210> 8 
<211> 67 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 8 

aagggcccaa ttacgcagag ggtcgactts ttaaaagaca ctaccaatct ggaactgtac 60 
cgtatcg 67 

<210> 9 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Oligopeptide 
<400> 9 

Cys Tyr Ala Lys Pro Leu Asn Lys Lys Gin Asn Asp Gin Thr Asp Thr 
15 10 15 

<210> 10 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Oligopeptide 
<400> 10 

Tyr Leu Thr Ala Arg Arg Gly Gin Gin Thr Thr Leu Gly Glu Val Val 

15 10 15 

Cys 
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SEQUENCE INFORMATION 

BASB027 Polynucleotide and Polypeptide Sequences 



SEQ ID NO:l 

Moraxella catarrhatis BASB027 polynucleotide sequence from strain ATCC 43617 



ATGCGTAATTCATATTTTAAAGGTTTTCAGGTCAGTGCAATGACAATGGCTGTCATGATG 
GTAATGTCAACTCATGCACAAGCGGCGGATTTTATGGCAZVATGACATTACCATCACAGGA 
CTACAGCGAGTGACCATTGAAAGCTTACAAAGCGTGCTGCCGTTTCGCTTGGGTCAAGTG 
GTGAGCGAAAACCAGTTGGCTGATGGTGTCAAAGCACTTTATGCAACAGGCAATTTTTCA 
GATGTGCAAGTCTAT CAT CAAGAAGGGCGTATC AT CTATCAGGT AAC CGAAAGG CCGTTA 
ATCGCTGAGATTAATTTTGAGGGCAATCGCTTAATTCCAAAAGAAGGTCTACAAGAAGGG 
CTAAAAAATGCTGGCTTAGCTGTGGGTCAACCACTAAAACAAGCCACAGTACAGATGATC 
GAAAC CG AG CTT AC CAAT CAAT AT AT AT C AC AAGGCT ATT AT AAT AC CG AAATTACTGTC 
AAACAGACGATGCTTGATGGTAATCGTGTTAAGCTTGATATGACCTTTGCTGAAGGTAAA 
CCTGCACGGGTGGTTGATATTAATATCATTGGCAATCAGCATTTTAGCGATGCAGATTTG 
ATTGATGTGCTTGCGATTAAGGATAATAAAATCAATCCACTGTCTAAAGCTGACCGTTAT 
ACTCAAGAAAAGCTGGTGACCAGTTTAGAGAATTTGCGTGCTAAATATCTCAATGCAGGG 
TTTGTG CGTTTTGAGATT AAAGATG CTAAGCTTAATATTAATGAAGATAAAAACCGTATC 
TTTGTTGAGATTTCATTGCATGAAGGTGAGCAATATCGCTTTGGACAGACACAGTTTTTG 
GGTAATTTAACTTAT ACT CAAGCAGAACTTGAGGCACTGCTTAAATT CAAAGCAGAAGAA 
GGGTTTTCACAAGCCATGCTTGAGCAAACAACAAACAATATCAGTACCAAATTTGGTGAC 
GATGGCTATTATTATGCTCAAATCCGTCCTGTAACACGCATTAATGATGAAAGTCGTACG 
GTTGATGTGGAATATTATATTGACCCTGTACACCCTGTCTATGTACGCCGTATTAATTTT 
ACAGGTAACTTTAAGACCCAAGATGAAGTACTCCGTCGTGAGATGCGACAACTTGAAGGT 
GCGTTGGCATCTAATCAAAAAATCCAGCTGTCTCGTGCACGCTTGATGCGGACTGGGTTT 
TTTAAACATGTTACCGTTGATACTCGTCCAGTACCCAACTCACCTGATCAGGTTGATGTA 
AATTTTGTGGTTGAAGAACAACCTTCAGGATCATCAACCATCGCAGCAGGCTACTCTCAA 
AGTGGTGGTGTAACTTTTCAATTTGATGTTTCTCAAAATAACTTTATGGGTACAGGTAAG 
CACGTCAATGCTTCGTTTTCTCGCTCTGAGACCCGTGAGGTGTATAGTTTGGGTATGACC 
AACCCATACTTTACCGTAAATGGCGTCTCGCAAAGCTTGAGTGGCTACTATCGTAAAACC 
AAGTATGATAACAAGAACATTAGTAATTATGTACTTGATTCTTATGGTGGCTCATTAAGC 
TATGGATATCCAATTGATGAAAATCAACGCATAAGCTTTGGTCTGAATGCTGACAATACC 
AAGCTTCATGGCGGTCGTTTTATGGGCATTAGTAATGTCAAGCAGCTGATGGCAGATGGT 
GGCAAAATTCAAGTGGATAATAATGGCATTCCTGATTTTAAGCATGATTACACAACCTAC 
AATGCCATTTTGGGGTGGAATTATTCAAGTCTAGATCGCCCTGTATTTCCAACCCAAGGC 
ATGAGTCATTCTGTAGATTTGACGGTTGGTTTTGGTGATAAAACTCATCAAAAAGTGGTT 
TATCAAGGCAATATCTATCGCCCATTTATCAAAAAATCAGTCTTGCGTGGATACGCCAAG 
TTAGGCTATGGCAATAATTTACCATTTTATGAAAATTTCTATGCAGGCGGCTATGGTTCG 
GTTCGTGGCTATGATCAATCCTCTTTGGGTCCACGCTCACAAGCCTATTTGACAGCTCGT 
CGTGGTCAACAAACCACACTAGGAGAGGTTGTTGGTGGTAATGCTTTGGCAACTTTCGGC 
AGTGAGCTGATTTTACCTTTGCCATTTAAAGGTGATTGGATAGATCAGGTGCGTCCAGTG 
ATATTCATTGAGGGCGGTCAGGTTTTTGATACAACAGGTATGGATAAACAAACCATTGAT 
TTAACCCAATTTAAAGACCCACAAGCAACAGCTGAACAAAATGCAAAAGCAGCCAATCGC 
CCGCTACTAACCCAAGATAAACAGTTGCGTTATAGTGCTGGTGTTGGTGCAACTTGGTAT 
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ACGCC C ATTGGTCCTTTAT CTATT AGCT ATG C C AAG C CATTGAAT AAAAAAC AAAATG AT 
CAGACCGATACGGTACAGTTCCAGATTGGTAGTGTCTTTTAA 

SEQ ID NO:2 

Moraxella catarrhalis BASB027 polypeptide sequence deduced from the 
polynucleotide sequence of SEQ ID NO:l 

MRNSYFKGFQVSAMTMAVTVIIV^ 

VSENQLADGVKALYATGNFSDVQVYHQEGRIIYQVTERPLIAEINFEGNRLIPKEGLQEG 
L KNAGLAVGQPLKQATVQM I ETELTNQ Y I S QG Y YNTE ITVKQTMLDGNRVKLDMTFAEGK 
PARWDINI IGNQHFSDADLIDVLAIKDNKINPLSKADRYTQEKLVTSLENLRAKYLNAG 
FVRFEIKDAKLNINEDKNRIFVEISLHEGEQYRFGQTQFLGNLTYTQAELEALLKFKAEE 
G F S Q AMLE QTTNN I S T KFGDDG Y Y Y AQ I RP VTR I ND E S RT VDVE Y Y I D P VH P VYVRR INF 
TGNFKTQDEVLRREMRQLEGAIiASNQKIQLSRARLMRTGFFKHVTVDTRPVPNSPDQVDV 
NFVVEEQ PS GS ST I AAGYSQSGGVTFQFDVSQNNFMGTGKHVNASFSRSETREVYSLGMT 
NPYFTVNGVSQSLSGYYRKTKYDNKNISNYVLDSYGGSLSYGYPIDENQRISFGLNADNT 
KLHGGR FMG I S NVKQLMADGGKI QVDNNG I PD F KHDYTTYNAI LGWNY SSLDRPVFPTQG 
MS HS VDLTVGFGD KTHQKWYQGNI YRP F I KKS VLRG YAKLG YGNNL P F YENF Y AGG YG S 
VRGYDQSSLGPRSQAYLTARRGQQTTLGEWGGNALATFGSELILPLPFKGDWIDQVRPV 
I F I EGGQVFDTTGMDKQTIDLTQFKD PQATAEQNAKAANRPLLTQDKQLRYS AGVGATWY 
TPIGPLSISYAKPLNKKQNDQTDTVQFQIGSVF 



SEQ ID NO:3 

Moraxella catarrhalis BASB027 polynucleotide sequence from strain ATCC 43617 



ATGCGTAATT CATATTTTAAAGGTTTTC AGGT CAGTGCAATGACAATGGCTGTCATGATG 
GTAATGTCAACTCATGCACAAGCGGCGGATTTTATGGCAAATGACATTACCATCACAGGA 
CT AC AG CG AGTG ACC ATTGAAAGCTT AC AAAG CGTG CTG C CGTTTCG CTTGGGT C AAGTG 
GTGAGCGAAAACCAGTTGGCTGATGGTGTCAAAGCACTTTATGCAACAGGCAATTTTTCA 
GATGTGCAAGTCTATCATCAAGAAGGGCGTATCATCTATCAGGTAACCGAAAGGCCGTTA 
ATCGCTGAGATT AATTTTGAGGGCAAT CGCTTAATT C CAAAAGAAGGT CTAC AAG AAGGG 
CTAAAAAATGCTGGCTTAGCTGTGGGTCAACCACTAAAACAAGCCACAGTACAGATGATC 
G AAAC CG AG CTT ACCAATC AAT AT AT AT C AC AAGG CT ATT AT AATACCG AAATT AC TGT C 
AAACAGACGATGCTTGATGGTAATCGTGTTAAGCTTGATATGACCTTTGCTGAAGGTAAA 
CCTGCACGGGTGGTTGATATTAATATCATTGGCAATCAGCATTTTAGCGATGCAGATTTG 
ATTGATGTGCTTGCGATTAAGGATAATAAAATCAATCCACTGTCTAAAGCTGACCGTTAT 
ACT CAAGAAAAGCTGGTGACCAGTTTAGAGAATTTG CGTG CTAAATATCTCAATGCAGGG 
TTTGTGCGTTTTGAGATTAAAGATGCTAAGCTTAATATTAATGAAGATAAAAACCGTATC 
TTTGTTGAGATTTCATTGCATGAAGGTGAGCAATATCGCTTTGGACAGACACAGTTTTTG 
GGTAATTTAACTTATACTCAAGCAGAACTTGAGGCACTGCTTAAATTCAAAGCAGAAGAA 
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GGGTTTTCACAAGCCATGCTTGAGCAAACAACAAACAATATCAGTACCAAATTTGGTGAC 
GATGGCTATTATTATGCTCAAATCCGTCCTGTAACACGCATTAATGATGAAAGTCGTACG 
GTTGATGTGGAATATTATATTGACCCTGTACACCCTGTCTATGTACGCCGTATTAATTTT 
ACAGGTAACTTTAAGACCCAAGATGAAGTACTCCGTCGTGAGATGCGACAACTTGAAGGT 
GCGTTGG CATCTAATCAAAAAATCCAGCTGTCTCGTGCACGCTTGATGCGGACTGGGTTT 
TTTAAACATGTTACCGTTGATACTCGTCCAGTACCCAACTCACCTGATCAGGTTGATGTA 
AATTTTGTGGTTGAAGAACAACCTTCAGGATCATCAACCATCGCAGCAGGCTACTCTCAA 
AGTGGTGGTGTAACTTTTCAATTTGATGTTTCTCAAAATAACTTTATGGGTACAGGTAAG 
CACGTCAATGCTTCGTTTTCTCGCTCTGAGACCCGTGAGGTGTATAGTTTGGGTATGACC 
AACCCATACTTTACCGTAAATGGCGTCTCGCAAAGCTTGAGTGGCTACTATCGTAAAACC 
AAGTATGATAACAAGAACATTAGTAATTATGTACTTGATTCTTATGGTGGCTCATTAAGC 
T ATGGAT AT C C AATTG ATGAAAAT C AACG C AT AAGCTTTGGT CTGAATG CT GACAAT AC C 
AAGCTTCATGGCGGTCGTTTTATGGGCATTAGTAATGTCAAGCAGCTGATGGCAGATGGT 
GGC AAAATT CAAGT GG AT AAT AATGG C ATT CCTGATTTTAAGC AT G ATT AC AC AACCT AC 
AATGCCATTTTGGGGTGGAATTATTCAAGTCTAGATCGCCCTGTATTTCCAACCCAAGGC 
ATGAGTCATTCTGTAGATTTGACGGTTGGTTTTGGTGATAAAACTCATCAAAAAGTGGTT 
TATCAAGGCAATATCTATCGCCCATTTATCAAAAAATCAGTCTTGCGTGGATACGCCAAG 
TT AGG CT ATGGC AAT AATTT ACC ATTTTATG AAAATTT CTATG C AGG CGG CT ATGGTT CG 
GTTCGTGGCTATGATCAATCCTCTTTGGGTCCACGCTCACAAGCCTATTTGACAGCTCGT 
CGTGGTCAACAAACCACACTAGGAGAGGTTGTTGGTGGTAATGCTTTGGCAACTTTCGGC 
AGTGAGCTGATTTTACCTTTGCCATTTAAAGGTGATTGGATAGATCAGGTGCGTCCAGTG 
ATATTCATTGAGGGCGGTCAGGTTTTTGATACAACAGGTATGGATAAACAAACCATTGAT 
TTAACCCAATTTAAAGACCCACAAGCAACAGCTGAACAAAATGCAAAAGCAGCCAATCGC 
C CG CT ACT AACCCAAG AT AAACAGTTGCGTT AT AGTGCTGGTGTTGGTGCAACTTGGT AT 
ACGCCCATTGGTCCTTTATCTATTAGCTATGCCAAGCCATTGAATAAAAAACAAAATGAT 
C AG ACCGAT ACGGT AC AGTT C C AGATTGGT AGTGT CTTTTAA 



SEQ ID NO:4 

Moraxella catarrhalis BASB027 polypeptide sequence deduced from the 
polynucleotide sequence of SEQ ID NO:3 

MRNSYFKGFQVSAMT^VMWMSTHAQAADFMANDITITGLQRVTIESLQSVLPFRLGQV 
VSENQIiADGVKALYATGNFSDVQVYHQEGRI IYQVTERPLIAEINFEGNRLI PKEGLQEG 
LKNAGLAVGQPLKQATVQMI ETELTNQYI SQGYYNTEITVKQTMLDGNRVKLDMTFAEGK 
PARWDINIIGNQHFSDADLIDVLAIKDNKINPLSKADRYTQEKLVTSLENLRAKYLNAG 
FVRFEIKDAKLNINEDKNRIFVEISLHEGEQYRFGQTQFLGNLTYTQAELEALLKFKAEE 
GFSQAMLEQTTNNISTKFGDDGYYYAQIRPVTRINDESRTVDVEYYIDPVHPVYVRRINF 
TGNFKTQDEVLRREMRQLEGALASNQKIQLSRARLMRTGFFKHVTVDTRPVPNSPDQVDV 
NFWEEQPSGSSTIAAGYSQSGGVTFQFDVSQNNFMGTGKHVNASFSRSETREVYSLGMT 
NPYFTVNGVSQSLiSGYYRKTKYDNKNI SNYVLDS YGGSLS YGYP I DENQR I S FGLNADNT 
KLHGGRFMG I SNVKQLM ADGGK I QVDNNG I PDF KHD YTTYNAI LGWNY SSLDR PVF PTQG 
MSHSVDLTVGFGDKTHQKWYQGNIYRPFIKKSVLRGYAKLGYGNNLPFYENFYAGGYGS 
VRGYDQSSLGPRSQAYLTARRGQQTTLGEWGGNALATFGSELILPLPFKGDWIDQVRPV 
IFIEGGQVFDTTGMDKQTIDLTQFKDPQATAEQNAKAANRPLLTQDKQLRYSAGVGATWY 
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TP IGPLS I SYAKPLNKKQNDQTDTVQFQIGS VF 



SEQ ID NO: 5 

ACT ATA GGG CAC GCG TG 

SEQ ID NO: 6 

CCT GCG TTT GTT TGA TTG AG 

SEQ ID NO: 7 

AAG GGC CCA ATT ACG CAG AGG GGA TCC ACA GGA CTA CAG CGA GTG 

ACC ATT GAA AGC TTA C 

SEQ ID NO: 8 

AAG GGC CCA ATT ACG CAG AGG GTC GAC TTA TTA AAA GAC ACT ACC 
AAT CTG GAA CTG TAC CGT ATC G 

SEQ ID NO: 9 

CYAKPLNKKQNDQTDT 
SEQ ID NO: 10 

YLTARRGQQTTLGEWC 
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